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to at least 70% of the base sequence of bases 
38538-54123 of SEQ ID N0.27, and which functions to 
restore fertility, is used. 
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Description 

FIELD OF THE INVENTION 



^le^ the rice ferti,*, and discerning the 

[0002] The present applicatton claims Drio L ZV«T? 9 D BT type W 88 "^ male sterility, 
tember 19, 20 01 , Japanese Pate T PP ,^^ K^^^? ^ 2 ° 01 ' 285247 ,iled °" Ap- 
plication No. 2002-185709 filed on June 26 2002 The Lr^l ^1°^ 4 ' 2 °° 1 and Japanese Patent Ap- 
rated herein. ' 002 ' The 6nt,re dlsclosur es of the three patent applications are incorpo- 



PRIOR ART 



P°''-sfromtheparentvarietywithwh^^ is eff -ted with 

for producing a large quantity of hybrid seeds on a commerclTscJie 9 ""'^^ ,sentire| y u ^*able 

of the sterile line with the poVen of the SZSlThlZ Si S ^ 0btained ,hr ° U9h fertili2ation 

with the poilen of the maintains line 688 00 the Ster " e " ne Can be maintained through its fertili 2a tion 

ofl^iKS 

gene and thatthe Rf-1 gene is ho^B^Xa.^ ^ ^ * ^i" 9 maintains Rf " 1 

to check to ensure that the variety usels the res rer ^ the triple-crossing system 

000* ° rde ; t0 9nSUre th3t the reSU ' tin9 h * brid seeds Si* Pr6SenCe ° f Rf " 

by investigating the incidence of ind ivTd ais that car oroducf^Jl , and the " M ^ flowed 

70-80% or more). The standard line re Ttoe ma'inta ner e h! i TT^ h ' 9her than 3 C6rtain ,evel <•* 
appropriately chosen depending upon whether ££^J£mT^ ! " ' ° f ^ tW ° ' ineS > and il is 
or unknown. If the standard line is a sterile I ne t is cTssedTo TXITT ** " °' BT *»» ° r n ° rmal 'VP- 
standard iine is a maintainer line, it is crossed J as the ^male parent " ^ f6ma ' e Parent and if the 

Efo^e^^^ 

environment such as cold climate or insufficie !SS t k ,nv ^tigat.on is made in an unfavorable 

tution, with the result that genotyping tfh XXw} T lTO8pec ^ ° f the 9enot VP e co ^i- 

[00081 With a view to Qniwinn thll u, 9 annot be P erfo Tned accurately. 

LebyaTchn!^^ 

absence of Rf-1 gene by dete cting base sequences " nkecfhf Rf TZTTT * CheCking f ° r the presence or 
as DNA markers). Note that it is not poJST^S^V^T TUST ™ ^ t0 

not been clarified so far. V RM 9ene s,nce tne DNA sequence of Rf-1 gene has 

[0009] For example, it has been reported that the locus of Rf-1 aene in rico k nr D =ont u 

between DNA marker (RFLP marker) loci G291 and G1?7 2 Z k L P . ° P chromos °mo 10 and located 
Phism analysis (RFLP) (Fukuta et ^^^"^^^ ^ 
the locus of Rf-1 gene by investigating the genotypes Zori^M^JX?* kT m *° W " ° f 9en ° typin 9 
[0010] However, the conventional mole^^^^ 

which need to be detected by Southern blot analvsi Tin ordl Z = ! problems - First - the V ^se RFLP markers 
«eve, needs to be purified from the JXLt h ~ h ^ ™A at the microgram 
steps compnsing treatment with restriction enzymes electroohor^k Z'r!» u u Carry ° Ut 3 se W e ™ of 

detection; this not only involves considerable ^bJ^S^^Z' h * b !f za «™ with a P«*e and signai 
[001 1] The second problem is that since the aene ml S/ ,! * * W96k t0 ° btain the test results 
long as about 30 cM (correspond ng'o about 9000 kbo TS^ST ^ M G291 a " d G127 is as 
recombination in the region wou.d be a «* ,he 0Ccurrence " double 

locus of Rf-1 gene can be estimated correct^ by the markers ^ 9Uaranteed ,ha < the ^type of the 

[0012] Third,, whenthepresenceof Rf-1 gene is estimated by detecting RFLP marker loci G291 and G127, not only 



> 



EP 1 437 409 A1 

S1S1B7/10g7),bolhor^«| l ,Jd l ^ l ^^^J^ ( l apaneSe Patent Publi ^ Disclosure No. 
their genetic distances from Rf-1 gene resp^ i ve y bein i 1 % ± 0 7 cM < ^ T * the '° CUS ° f RM ^e, 
and 3,7 ± 1.1 cM (corresponding to about 1110 2 In ice nil LZ ( 'TT"*" 9 '° 3b ° Ut 480 kbp in rice DNA > 
RM gene. Hence, the presence of Rf-1 gene can be TstZ^ Z r, ?" ° C ' be '" 9 ° n ° PP ° Site sides of ,he ,ocu * ° 
dominant PCRmarker and that ^ciJ^^S^^SS 0f both the of the 

us to ti te as t0 wnether the locus of m TtS^^ZST^ PCR marker a,so enables 

the locus of the co-dominant marker a d haTof "r m « ^ eroz ^ us - However, if recombination occurs between 
correctly, particularly as to whether Tha£^Z££Z *™T ° RM 9e " e '° CUS cannot be de ^n<* 
used to genotype the locus of Rf-1 gene, ^SZ^JSSSSZ i , ^ ^ ' d ° minant marker is 
are homozygous (Rf-1/RM) or hetL^s StS^^?™!?^ irreSpeCtive °' Aether they 

markeris used in combination with the dominant Z^lTo^TlT 9 T\ 6Ven ifth « co-dominant 

to correctly distinguish individuals having Rf-1 ^S^^J^^. "*? ° f RM 9ene > ■ is " ot P-^ 
if no amplification product is obtained in PCR 7sing the ^^^ZZT^^J*" F ^her, 
due to some problems in the experimental procedure As ZtoeZ*lZ -^T *"* the P ° SSM ^ that this is 
co-dominant marker and the dominant marker is as urea. as abouTsT^T' !f< diSt8nCe betWeen the 

sornal region introduced from the Rf-1 gene donor parent cannot b « ^ * the Chr ° m °- 

conc i t nt jntrod of a de)ete J s gene J*™™ J ^ ^o^y small value to prevent any 

[001 5] Japanese Patent Public Disclosure No 1 39465/2nnn H a =liK ! 9 

oped on the basis of the base sequences of RFLP m^rkers^ocateri^n P ° R markerS that were dev *'- 

1 0 of rice. However, most of those PCR ^7e)s^olV^ Tl " e ' 9hborhood ° f W °n chromosome 
about 1 cM. * 6rS are spaced from tne Rf " 1 by a genetic distance greater than 



SUMMARY OF THE INVENTION! 



^Z^^l^l T^^TZTT reSt ° rin9 riCS f6rti,ity - A method °< present 
N0.27, or has a base sequence J^^t^l^ ^."2? ^ "» bBM S ^°™ °< « ID 
functions to restore fertility. Another method of tte^Z£^ SeqUe " Ce ° f SE ° ,D N °' 27 ' and which 
wherein the nucleic acid has the base se^u^^^Z^T^ I"" 9 * nUC ' eiC acid int0 rice - 
which Is identical to at least 70% of the base seauenrfofh! o«J D SEQ ID N0 27 ' or nas a bas * sequence 
to restore fertility. Still another *^^£££^J^™ ° f SEQ '° N0 27 > and which functions 
the nucleic acid has the base sequence o ba^S ^^ e "^ compnses mtroducing a nucleic acid into rice, wherein 
27, or has a base sequence whL Is ZZno T ^/ 717^ 42132 " 48883 ° f SEQ ID N0 ' 
preferably, bases 42132-48883 of SEQ ID NO 27 andlS 1 r , ° f baS6S 423 ^-53743, more 

methods of the present invention, a base sequence ZS^^ST ? ^ a " emb ° d '™nt of the 

1 ) and 2): 1 ^ J 0T bhQ 10 N0 27 mee ts at least one of the following requirements 

1) a base corresponding to the base 45461 of SEQ ID NO 27 is A- and 

2) a base corresponding to the base 49609 of SEQ ID N0.27 is A.' 

that a sequence determining the presenc o ll Z^ n J* * ° f the present inven,ion ^ a fact 

the polymorphism detection ^nm^^J^S^Z^- 9 ™ ^ ^ P ° Siti ° ns between 
[001 8] In an embodiment of the methods of thi n , ' ° n " Ce chrom osome 1 0. 

determined to have the Rf-1 g^I^^SSa' ha^ ^ indMdUa ' » ,he Seed *«* * 
least 70% of the base sequence of SEQ ID NO 27 o of the Se l n ^ 9 ^ Wh ' Ch is identical to * 

w.* f or of the base sequence of bases 38538-54123 of SEQ ID N0.27, 
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meets at least one of the following requirements 1) and 2): 

1) a base corresponding to the base 45461 of SEQ ID No.27 is A; and 

2) a base corresponding to the base 49609 of SEQ ID NO.27 is A. 

[0019] Another object of the present invention is to provide a method for inhibiting the function of the Rf-1 gene to 
restore fertility. The inhibition method of the present invention comprises, in an embodiment, introducing an antisense 
having at least 100 continuous bases in length, and having a base sequence complementary to a nucleic acid having 
the base sequence of SEQ ID NO.27, or to a nucleic acid having a base sequence which is identical to at least 70% 
of the base sequence of SEQ ID NO.27, and which functions to restore fertility. In another embodiment, the inhibition 
methods of the present invention comprise introducing an antisense having at least 100 continuous bases in length, 
and having a base sequence complementary to a nucleic acid having the base sequence of bases 38538-54123 of 
SEQ ID NO.27, or to a nucleic acid having a base sequence which is identical to at least 70% of the base sequence 
of bases 38538-541 23 of SEQ ID NO.27, and which functions to restore fertility. 

[0020] Another object of the present invention is to provide a nucleic acid having the base sequence of SEQ ID NO. 
27, or a nucleic acid having a base sequence which is identical to at least 70% of the base sequence of SEQ ID NO. 
27, and which functions to restore fertility. The present invention also provides a nucleic acid having the base sequence 
of bases 38538-54123 of SEQ ID NO.27, or a nuclei acid having a base sequence which is identical to at least 70% 
of the base sequence of bases 38538-54123 of SEQ ID NO.27, and which functions to restore fertility. The present 
invention also provides a nucleic acid having the base sequence of bases 42357-53743, more preferably, bases 
42132-48883 of SEQ ID NO.27, or a nucleic acid having a base sequence which is identical to at least 70% of the 
base sequence of bases 42357-53743, more preferably, bases 42132-48883 of SEQ ID NO.27, and which functions 
to restore fertility. 

BRIEF DESCRIPTION OF THE DRAWING 
[0021] 

Fig. 1 shows the results of chromosomal walking started from the RFLP marker locus S12564. 
Fig. 2 shows an alignment of lambda clone contigs in relation to the BAG clone AC068923. 
Fig. 3 shows the chromosomal organization of recombinant pollens proximal to the Rf-1 locus (all fertile) as mapped 
in close proximity to the Rf-1 locus based on the genotypes at the marker loci of 1 0 individuals (RS1 , RS2, RC1 -8) 
generated from the pollens. White bars represent japonica regions and black bars represent indica regions. 
Fig. 4 is a gene map in which the locus of Rf-1 gene on chromosome 10 of rice is positioned on a linkage map in 
relation to various markers; the values of map distance were calculated from the segregation data from 1042 F1 
individuals. 

Fig. 5 shows fragments from 1 0 genomic clones used for the identification of the Rf-1 region by complementation 
assays. Lambda clones obtained by chromosomal walking (thin lines) were used for complementation assays of 
the chromosomal regions shown by bold lines. XSF18 was found to contain a deletion shown by dotted line. 
Fig. 6 shows the results of complementation assays using a 15.7 kb fragment from XSG16 (Example 10) and a 
16.2 kb fragment from XSF18 (Example 8). The plant transformed with the 15.7 kb fragment from XSG16 has 
restored fertility as proved by ears bowing. 

BEST MODES FOR PERFORMING THE INVENTION 

[0022] We began by restricting the Rf-1 locus to a very small region on chromosome 1 0. On this basis, we developed 
PCR markers proximal to the Rf-1 locus and found a method for detecting the RM gene by utilizing on the linkage of 
these PCR markers to the Rf-1 locus. Specifically, the presence of the Rf-1 gene is tested and individuals homozygous 
for the Rf-1 gene are selected by genotyping at the novel PCR marker loci proximal to the Rf-1 locus on the basis that 
the Rf-1 locus is mapped between the PCR marker loci S12564 Tsp509l and C1361 Mwol on chromosome 1 0 of rice. 
We previously filed a patent application for the method for detecting the Rf-1 gene under Japanese Patent Application 
No. 2000-247204 on August 17, 2000. The entire disclosure of the patent application is incorporated herein by refer- 
ence. 

I. Methods for estimating the genotype at the Rf-1 locus described in Japanese Patent Application No. 2000-247204 

[0023] Japanese Patent Application No. 2000-247204 describes methods for determining whether or not a rice in- 
dividual or seed under test has the Rf-1 gene on the basis that the Rf-1 locus is mapped between the PCR marker loci 
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S12564 Tsp509l and C1361 Mwol on chromosome 10 of rice. 
Markers 

[0024] Primer pairs designed to be specific to particular regions near the locus of Rf-1 gene are used in PCR and 
the amplification products are treated with particular restriction enzymes; upon electrophoresis, rice of indica lines in 
some cases provide an observable band of a different size from that of rice of Japonica lines. This band which is 
characteristic of indica lines is herein referred to as the Rf-1 linked band. Now that it has been made clear by the 
present inventors that the locus of Rf-1 gene is located between PCR markers S12564 Tsp509l and C1361 Mwol on 
chromosome 1 0 of rice, the skilled artisan can appropriately develop and employ PCR markers that are present in the 
neighborhood of Rf-1 gene. 

[0025] For instance, according to the invention, a rice individual under test is checked to see if its genome contains 
at least one of the PCR markers listed below, thereby determining whether the individual under test has Rf-1 gene 
linked to those PCR markers: 

(1) marker 1: PCR marker R1877 EcoRI which, when rice genomic DNA is subjected to PCR with DNA primers 
having the sequences of SEQ ID NO:1 and SEQ ID NO:2, can detect polymorphisms between rice individuals of 
the japonica and indica lines depending on whether the amplification products have a recognition site for restriction 
enzyme EcoRI; 

(2) marker 2: PCR marker G4003 Hindlll (SEQ ID NO:19) which, when rice genomic DNA is subjected to PCR 
with DNA primers having the sequences of SEQ ID NO:3 and SEQ ID NO:4, can detect polymorphisms between 
rice individuals of the japonica and indica lines depending on whether the amplification products have a recognition 
site for restriction enzyme Hindlll; 

(3) marker 3: PCR marker C1361 Mwol (SEQ ID NO:20) which, when rice genomic DNA is subjected to PCR 
employing DNA primers having the sequences of SEQ ID NO:5 and SEQ ID NO:6, can detect polymorphisms 
between rice individuals of the japonica and indica lines depending on whether the amplification products have a 
recognition site for restriction enzyme Mwol; 

(4) marker 4: PCR marker G2155 Mwol (SEQ ID NO:21) which, when rice genomic DNA is subjected to PCR with 
DNA primers having the sequences of SEQ ID NO:7 and SEQ ID NO:8, can detect polymorphisms between rice 
individuals of the japonica and indica lines depending on whether the amplification products have a recognition 
site for restriction enzyme Mwol; 

(5) marker 5: PCR marker G291 Mspl (SEQ ID NO:22) which, when rice genomic DNA is subjected to PCR with 
DNA primers having the sequences of SEQ ID NO:9 and SEQ ID NO:1 0, can detect polymorphisms between rice 
individuals of the japonica and indica lines depending on whether the amplification products have a recognition 
site for restriction enzyme Mspl; 

(6) marker 6: PCR marker R2303 Bsll (SEQ ID NO:23) which, when rice genomic DNA is subjected to PCR with 
DNA primers having the sequences of SEQ ID NO:1 1 and SEQ ID NO: 1 2, can detect polymorphisms between rice 
individuals of the japonica and indica lines depending on whether the amplification products have a recognition 
site for restriction enzyme Bs1 1; 

(7) marker 7: PCR marker S10019 BstUI (SEQ ID NO:24) which, when rice genomic DNA is subjected to PCR 
with DNA primers having the sequences of SEQ ID NO:1 3 and SEQ ID NO:1 4, can detect polymorphisms between 
rice individuals of the japonica and indica lines depending on whether the amplification products have a recognition 
site for restriction enzyme BstUI; 

(8) marker 8: PCR marker S1 0602 Kpnl (SEQ ID NO:25) which, when rice genomic DNA is subjected to PCR with 
DNA primers having the sequences of SEQ ID NO:15 and SEQ ID NO:16, can detect polymorphisms between 
rice individuals of the japonica and indica lines depending on whether the amplification products have a recognition 
site for restriction enzyme Kpnl; and 

(9) marker 9: PCR marker S12564 Tsp509l (SEQ ID NO:26) which, when rice genomic DNA is subjected to PCR 
with DNA primers having the sequences of SEQ ID NO:1 7 and SEQ ID NO:1 8, can detect polymorphisms between 
rice individuals of the japonica and indica lines depending on whether the amplification products have a recognition 
site for restriction enzyme Tsp509l. 

[0026] Assuming that the locus of Rf-1 gene was highly likely to be located near the nine RFLP marker regions 
R1 877, G291 , R2303, S1 2564, C1 361 , S1 001 9, G4003, S1 0602 and G21 55 on chromosome 1 0 of rice (see the results 
of RFLP linkage analysis described in Fukuta et al., 1992, Jpn. J. Breed. 42 (supl. 1) 164-165 and the RFLP linkage 
map of rice described in Harushima et al., 1 998, Genetics, 1 48, 479-494), the present inventors converted those RFLP 
markers to co-dominant PCR markers such as CAPS markers or dCAPS markers as described below in Reference 
example 1 (Michaels and Amasino, 1998, The Plant Journal, 14(3), 381-385; Neff et al., 1998, The Plant Journal, 14 
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(3), 387-392). As a result of this conversion, the PCR markers above have been obtained. 

[0027] Among these PCR markers, one group consisting of PCR markers R1877 EcoRI, G291 Mspl (SEQ ID NO: 
22), R2303 Bsll (SEQ ID NO:23) and S12564Tsp509l (SEQ ID NO:26) and the other group consisting of PCR markers 
C1361 Mwol (SEQ ID NO:20), S10019 BstUI (SEQ ID NO:24), G4003 Hindlll (SEQ ID NO:19), S10602 Kpnl (SEQ ID 

5 NO:25) and G2155 Mwol (SEQ ID NO:21) are on opposite sides of the locus of RM gene on chromosome 10 of rice. 
[0028] Therefore, in one embodiment, the presence of the Rf-1 gene is detected by detecting RM linked bands by 
(a) at least one PCR marker selected from the group consisting of PCR markers R1877 EcoRI, G291 Mspl, R2303 
Bsll and S1 2564 Tsp509l, and (b) at least one PCR marker selected from the group consisting of PCR markers C1 361 
Mwol, S1 001 9 BstUI, G4003 Hindlll, S10602 Kpnl and G2155 Mwol. In this case, at least S12564Tsp509l from group 

10 (a) and at least C1361 Mwol from group (b) are preferably used as the closest PCR markers to the Rf-1 gene. If RM 
linked bands are detected with PCR markers of both (a) and (b) in the genome of the rice under test, it can be estimated 
with a high probability that the rice contains RM gene. 

[0029] In another embodiment, RM linked bands are detected by at least two PCR markers of group (a) and at least 
two PCR markers of group (b) above. For example, a rice individual carrying the RM gene with a minimum of unwanted 

15 gene regions can be selected by picking up an individual in which RM linked bands are detected by markers of groups 
(a) and (b) more proximal to the RM gene but not detected by markers of groups (a) and (b) more distal from the Rf- 
1 gene on the gene map shown in Fig. 1 . Again, it is preferred that at least one PCR marker of group (a) is S12564 
Tsp509l and at least one PCR marker of group (b) is C1361 Mwol. Thus, the two PCR marker loci S12564 Tsp509l 
and C 1 361 Mwol are separated by a genetic distance of 0.3 cM. By utilizing this characteristic, the chromosomal region 

20 that is introduced from the Rf-1 gene donor parent can be narrowed down to a size of about 1 cM. This helps minimize 
the possibility of introducing into the restorer line a deleterious gene that may be present in the neighborhood of RM 
gene in the donor parent. 

Detection of the Rf-1 gene 

25 

[0030] In order to detect RM gene in the genome of a rice under test, any one of the above PCR markers is amplified 
from the genome of the rice by PCR using primers of SEQ ID NOS: 1 -18 above and then detected by the polymerase 
chain reaction-restriction fragment length polymorphism method (PCR-RFLP). PCR-RFLP is a method that is applica- 
ble to the case where polymorphisms exist among variety lines at recognition sites of restriction enzymes in the se- 
30 quences of PCR amplified DNA fragments and by which specific polymorphisms can conveniently be identified on the 
basis of cleavage patterns with those restriction enzymes (D.E. Harry et al., Theor. Appl. Genet. (1998), 97:327-336) 
[0031 ] Restriction enzyme cleavage patterns show the bands as shown in Table 1 below on a visualized gel depend- 
ing on the primer pair used. 

35 Table 1 





Approximate size (bp) of detected band 


Detection of marker 1 (R1877 EcoRI) with primer pair 1 


When the genome of test rice has Rf-1 gene homozygously 
When the genome of test rice has Rf-1 gene heterozygously 
When the genome of test rice has no Rf-1 gene 


1500 and 1700 
1500, 1700 and 3200 
3200 


Detection of marker 2 (G4003 Hindlll) with primer pair 2 




When the genome of test rice has Rf-1 gene homozygously 
When the genome of test rice has RM gene heterozygously 
When the genome of test rice has no Rf-1 gene 


362 

95, 267 and 362 

95 and 267 ' 


Detection of marker 3 (C1361 Mwol) with primer pair 3 




When the genome of test rice has Rf-1 gene homozygously 
When the genome of test rice has RM gene heterozygously 
When the genome of test rice has no Rf-1 gene 


50 and 107 
25,50, 79 and 107 
25, 50 and 79 


Detection of marker 4 (G2155 Mwol) with primer pair 4 




When the genome of test rice has Rf-1 gene homozygously 
When the genome of test rice has Rf-1 gene heterozygously 
When the genome of test rice has no Rf-1 gene 


25, 27 and 78 
25, 27, 78 and 105 
25 and 105 
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Table 1 (continued) 







Detection of marker 5 (G291 Mspl) with primer pair 5 


When the genome of test rice has RM gene homozygously 
When the genome of test rice has Rf-1 gene heterozygously 
When the genome of test rice has no Rf-1 gene 


25, 49 and 55 
25, 49, 55 and 104 

95 anH 1 04 


Detection of marker 6 (R2303 Bsll) with primer pair 6 




When the genome of test rice has RM gene homozygously 
When the genome of test rice has Rf-1 gene heterozygously 
When the genome of test rice has no Rf-1 gene 


238, 655 and 679 
238, 655, 679 and 1334 


Detection of marker 7 (S10019 BstUI) with primer pair 7 




When the genome of test rice has Rf-1 gene homozygously 
When the genome of test rice has Rf-1 gene heterozygously 
When the genome of test rice has no Rf-1 gene 


130, 218 and 244 
130, 218, 244 and 462 

1 anrl /tfiO 
I OU dflU HOc. 


Detection of marker 8 (S10602 Konh with orimer Dair 8 




When the genome of test rice has Rf-1 gene homozygously 
When the genome of test rice has Rf-1 gene heterozygously 
When the genome of test rice has no Rf-1 gene 


724 

117, 607 and 724 
117 and 607 


Detection of marker 9 (S12564 Tsp509l) with primer pair 9 




When the genome of test rice has Rf-1 gene homozygously 
When the genome of test rice has RM gene heterozygously 
When the genome of test rice has no Rf-1 gene 


41 and 117 
26,41,91 and 117 
26. 41 and 91 



II. Identification of the Rf-1 locus 

[0032] As described above, Japanese Patent Application No. 2000-247204 discloses RFLP-PCR markers based on 
our finding that the Rf-1 locus is mapped between DNA marker loci S 1 2564 Tsp509l and C 1 36 1 Mwol . Fertility-restoring 
lines are established by backcrossing the RM gene into a normal japonica variety not containing the RM gene. If the 
method for identifying the Rf-1 locus described in Japanese Patent Application No. 2000-247204 is used during this 
process, not only the restoring lines can be established efficiently (within 2-3 years) but also the length of insert frag- 
ments can be controlled. 

[0033] However, introduction by crossing inevitably introduce regions proximal to RM at the same time. Japanese 
Patent Application No. 2000-247204 showed that the Rf-1 locus is mapped between DNA marker loci S1 2564 Tsp509l 
and C1361 Mwol, but the distance between both loci is about 0.3 cM, i.e. about 90 kbp. If a deleterious gene existed 
proximal to Rf-1 , it would be undeniable that the deleterious gene might be inserted together with the Rf-1 gene. 
[0034] Thus, we searched for regions linked to the RM gene between DNA marker loci S12564Tsp509l and C1361 
Mwol by chromosomal walking and genetic analysis based on the close linkage between the Rf-1 locus and the DNA 
marker locus S12564 Tsp509l. As a result, we successfully identified the region of the RM locus including the Rf-1 
gene upto about 76 kb and determined the entire base sequence of said region. According to the present invention, it 
is possible to introduce the function of a fertility restorer gene into BT male sterile cytoplasms by genetic engineering 
techniques. 

[0035] Specifically, in Japanese Patent Application No. 2000-247204, linkage analyses on a population of 1042 in- 
dividuals prepared by pollinating MS Koshihikari with MS-FR Koshihikari (heterozygous at the Rf-1 locus) revealed 
one recombinant between the Rf-1 and S1 2564 Tsp509l loci and two recombinants between the Rf-1 and C1 361 Mwol 
loci (Reference examples 1 -2 herein). In the present invention, 41 03 individuals were added to the population to analyze 
atotal of 5145 individuals. Asa result, one recombinant between the Rf-1 and S1 2564 Tsp509l loci and six recombinants 
between the Rf-1 and C1361 Mwol loci were newly found with a total of 2 and 8 recombinants. These 10 individuals 
were tested by the high-precision segregation analysis of the present invention as recombinants proximal to the RM 
locus (Example 1). 

[0036] The frequency of 8 recombinants between the Rf-1 and C1361 Mwol loci as compared with 2 recombinants 
between the Rf-1 and S1 2564 Tsp509l loci means that the S1 2564 Tsp509l locus is genetically closer to the Rf-1 locus 
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than the C1361 Mwol locus. Genetic distance (expressed in recombination frequency: cM) and physical distance (ex- 
pressed in the number of base pairs: bp) are not always proportional to each other, but it can be normally expected 
that physical distance decreases with genetic distance. 

[0037] Thus, we tried to isolate the RM locus by chromosomal walking started from the S12564 Tsp509l locus 
(Example 2). Chromosomal walking was performed on a genomic library prepared from X DASH II vector using the 
genomic DNA of an indica variety IR24 and a japonica variety Asominori. IR24 is a variety carrying Rf-1 , while Asominori 
is a variety not carrying RM . As a result of chromosomal walking, contigs covering a chromosomal region of about 76 
kb (ordered sets of overlapping clones on a chromosome) were able to be prepared from genomic clones of 1R24, and 
the entire base sequence (76363 bp) thereof was determined. 

[0038] Then, 12 markers were newly developed on the basis of the base sequence data or the like obtained and a 
high-precision segregation analysis was performed on the 10 recombinants proximal to Rf-1 locus described above 
(Example 3). As a result, a 65 kb sequence included in the chromosomal region of about 76 kb above was shown to 
contain a sequence determining the presence of the function of the Rf-1 gene. This region is covered by a contig 
consisting of 8 genomic clones. Each clone has a length of about 12-22 kb and has overlapping domains of at least 
4.7 kb. Genes for rice are known to have a wide range of lengths (from short ones to large ones), but most of them 
seem to have a length of several kbs or less. Thus, at least one of these 8 genomic clones is expected to contain the 
full-length Rf-1 gene. 

[0039] We further restricted the Rf-1 gene region in the chromosomal region of about 76 kb above and performed 
complementation assays to directly demonstrate the presence of a fertility restoring ability. 

[0040] Specifically, 10 partial fragments (each 10-21 kb) in the above region of 76 kb were separately introduced 
into immature seeds of a male sterility line MS Koshihikari by genetic engineering techniques (Fig. 5). Of the 1 0 partial 
fragments used, 8 fragments are derived from 8 genomic clones previously obtained by chromosomal walking (XSE1 , 
XSE7, XSF4, XSF20, XSG22, XSG16, XSG8 and XSH18 shown in Fig. 1 and described in Example 3). Additionally! 
fragments derived from 2 clones XSF1 8 and XSX1 were also analyzed by complementation assays. XSF1 8 is identical 
to XSF20 at the 5' and 3' ends (bases 20328 and 41921 of SEQ ID NO: 27, respectively), but lacks internal bases 
33947-38591 . This is because clone XSF18 was initially isolated but found to contain the above deletion during am- 
plification after isolation, and therefore, the amplification step was freshly taken to isolate a complete clone designated 
XSF20 (Example 8). XSX1 is a clone freshly prepared from clones XSG8 and XSH18 by restriction enzyme treatment 
and ligation to contain sufficient overlapping domains because of the overlapping domains of both clones are relatively 
small (about 7 kb) (Example 13). 

[0041] If the insert fragment completely contains the Rf-1 gene, transformed individuals at this generation restore 
fertility because Rf-1 is a dominant gene. In complementation assays plants transformed with each fragment were 
evaluated for seed fertility to find that those transformed with a 1 5.6 kb fragment (including bases 38538-541 23 of SEQ 
ID NO: 27) derived from the X phage clone XSG16 restored seed fertility (Example 10). Plants transformed with the 
other fragments were all sterile. These results showed that the above 15.6 kb fragment completely contains the Rf-1 
gene. Moreover, a method for introducing the Rf-1 gene by genetic engineering techniques was provided by the present 
invention and demonstrated to be effective. 

[0042] To further specify the region of the X phage clone XSG1 6 in which the Rf-1 gene is contained, we evaluated 
seed fertility of shorter fragments than the 15.6 kb fragment (including bases 38538-54123 of SEQ ID NO: 27) by 
complementation assays. As a result, plants transformed with a 1 1 .4 kb fragment derived from XSG1 6 (including bases 
42357-53743 of SEQ ID NO: 27) were shown to restore seed fertility (Example 10(2)). Plants transformed with a further 
shorter 6.8 kb fragment (including bases 421 32-48883 of SEQ ID NO: 27) also restored seed fertility (Example 1 0(3)). 
These results showed that the above 6.8 kb fragment contains the Rf-1 gene. 

III. Nucleic acids containing the RM locus 

[0043] The present invention provides nucleic acids containing the locus of a fertility restorer gene (RM ). The nucleic 
acids containing the locus of a fertility restorer gene (RM) of the present invention include a nucleic acid having the 
base sequence of SEQ ID N0.27, or a nucleic acid having a base sequence which is identical to at least 70% of the 
base sequence of SEQ ID NO. 27, and which functions to restore fertility. 

[0044] As described in Example 1 0, it was confirmed that the Rf-1 gene is completely contained in especially bases 
38538-541 23 of the base sequence of SEQ ID NO: 27. Accordingly, the present invention especially provides a nucleic 
acid having the base sequence of bases 38538-54123 of SEQ ID N0.27, or has a base sequence which is identical 
to at least 70% of the base sequence of bases 38538-54123 of SEQ ID N0.27, and which functions to restore fertility. 
As used herein, the term "the base sequence of SEQ ID NO: 27" refers to the entire sequence of SEQ ID NO: 27 or a 
part thereof participating in fertility restoring function, especially bases 38538-54123 according to the context. More 
preferably, it refers to bases 42357-53743, still more preferably bases 421 32-48883. 

[0045] In the examples below, a nucleic acid was isolated from a genomic library of indica rice IR24 containing the 
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Rf-1 gene as a nucleic acid containing a fertility restorer gene (RM) and determined to have the base sequence of 
SEQ ID NO: 27. However, the nucleic acid containing a fertility restorer gene (RM) of the present invention can be 
derived from any indica variety carrying the RF-1 gene. The indica varieties carrying the RM gene include, but not 
specifically limited to, e.g. IR24, IR8, IR36, IR64, Chinsurah and Boroll. Known japonica varieties not carrying the Rf- 
1 gene include, but not limited to, Asominori, Koshihikari, Kirara 397, Akihikari, Akitakomachi, Sasanishiki, Kinuhikari, 
Nipponbare, Hatsuboshi, Koganebare, Hinohikari, Mineasahi, Aichinokaori, Hatsushimo, Akebono, Fujihikari, Minenoy- 
ukimochi, Kokonoemochi, Fukuhibiki, Dontokoi, Gohyakumangoku, Hanaechizen, Todorokiwase, Haenuki, Domanna- 
ka, Yamakikari, etc. The "indica" and "japonica" varieties are well known to those skilled in the art and the rice varieties 
encompassed by the present invention can be readily determined by those skilled in the art. 
[0046] Nucleic acids of the present invention include DNA in both single-stranded and double-stranded forms, as 
well as the RNA complement thereof. DNA includes, for example, genomic DNA (including corresponding cDNA), 
chemically synthesized DNA, DNA amplified by PCR, and combinations thereof. 

[0047] Nucleic acids containing the RM gene of the present invention preferably have the base sequence of SEQ 
ID NO; 27. More than one codon may encode the same amino acid, and this is called degeneracy of the genetic code. 
Thus, a DNA sequence not completely identical to S EQ ID NO: 27 may encode a protein having an amino acid sequence 
completely identical to SEQ ID NO: 27. Such a variant DNA sequence may result from silent mutation (e.g., occurring 
during PCR amplification), or can be a product of deliberate mutagenesis of a native sequence. 
[0048] It is well known for those skilled in the art that even proteins having the same function may have different 
amino acid sequences depending on the varieties from which they are derived. The RM gene of the present invention 
includes such homologs and variants of the base sequence of SEQ ID NO: 27 so far as they functions to restore fertility. 
The expression "function to restore fertility" means that fertility is conferred on a rice individual or seed when such a 
DNA fragment is introduced. Fertility restoration may result from the expression of a protein by the Rf-1 gene or some 
function of the nucleic acid (DNA or RNA) per se of the RM gene in conferring fertility. 

[0049] Whether or not a homolog or variant of the RM gene functions to restore fertility can be examined by, but not 
limited to, the following method, for example. A nucleic acid fragment under test is introduced into immature seeds 
obtained by pollinating MS Koshihikari (sterile line) with MS-FR Koshihikari according to the method of Hiei et al. (Plant 
Journal (1 994), 6(2), p. 272-282). As the resulting transformants arecultured under normal conditions, the seeds mature 
only when the nucleic acid fragment under test functions to restore fertility. 

[0050] The nucleic acid derived from a corresponding region of japonica Asominori not carrying the RM gene has 
the base sequence shown in SEQ ID NO: 28. Corresponding parts of SEQ ID NO: 28 and SEQ ID NO: 27 have an 
overall identity of about 98%. Thus, nucleic acids containing the locus of a fertility restorer gene (Rf-1) of the present 
invention are at least about 70%, preferably about 80% or more, more preferably 90% or more, still more preferably 
95% or more, most preferably 98 or more% identical to SEQ ID NO: 27. 

[0051] The percent identity may be determined by visual inspection and mathematical calculation. Alternatively, the 
percent identity of two nucleic acid sequences can be determined by comparing sequence information using the GAP 
computer program, version 6.0 described by Devereux et al., Nucl. Acids Res., 12:387 (1984) and available from the 
University of Wisconsin Genetics Computer Group (UWGCG). The preferred default parameters for the GAP program 
include: (1) a unary comparison matrix (containing a value of 1 for identities and 0 for non-identities) for bases, and 
the weighted comparison matrix of Gribskov and Burgess, Nucl. Acids Res., 1 4:6745 (1 986), as described by Schwartz 
and Dayhoff, eds., Atlas of Protein Sequence and Structure, National Biomedical Research Foundation, pp. 353-358 
(1 979); (2) a penalty of 3.0 for each gap and an additional 0.1 0 penalty for each symbol in each gap; and (3) no penalty 
for end gaps. Other programs used by those skilled in the art of sequence comparison may also be used. 
[0052] Nucleic acids of the present invention also include nucleic acids which are capable of hybridizing to the base 
sequence of SEQ ID NO: 27 under conditions of moderately stringent conditions and functions to restore fertility, and 
nucleic acids which are capable of hybridizing to the base sequence of SEQ ID NO: 27 under conditions of highly 
stringent conditions and functions to restore fertility. 

[0053] As used herein, conditions of moderate stringency can be readily determined by those having ordinary skill 
in the art based on, for example, the length of the DNA. The basic conditions are set forth by Sambrook et al. Molecular 
Cloning: A Laboratory Manual, 2nd. Vol. 1, pp. 1.101-104, Cold Spring Harbor Laboratory Press, (1989), and include 
use of a prewashing solution for the nitrocellulose filters 5 x SSC, 0.5% SDS, 1 .0 mM EDTA (pH 8.0), hybridization 
conditions of about 1 x SSC to 6 x SSC at about 40°C to 60°C (or other similar hybridization solution, such as Stark's 
solution, in about 50% formamide at about 42°C), and washing conditions of about 60°C, 0.5 x SSC, 0.1% SDS. The 
hybridization temperature is about 15-20°C lower when the hybridization solution contains about 50% formamide. 
Conditions of high stringency can also be readily determined by the skilled artisan based on, for example, the length 
of the DNA. Generally, conditions of high stringency include hybridization and/or washing conditions at higher temper- 
atures and/or lower salt concentrations than in the conditions of moderate stringency described above. For example, 
such conditions include hybridization conditions of 0.1 x SSC to 0.2 x SSC at about 60-65°C and/or washing conditions 
of 0.2 x SSC, 0.1% SDS at about 65-68°C. The skilled artisan will recognize that the temperature and wash solution 
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salt concentration can be adjusted as necessary according to factors such as the length of the probe. 
[0054] DNAs of the present Invention also Include nucleic acids that differ from the base sequence of SEQ ID NO: 
27 due to deletions, insertions or substitutions of one or more bases while retaining a fertility restoring function. So far 
as a fertility restoring function is retained, the number of bases to be deleted, inserted or substituted is not specifically 
limited, but preferably 1 to several thousands, more preferably 1 -1 000, still more preferably 1 -500, even more preferably 
1 -200, most preferably 1 -1 00. 

[0055] Once the Rf-1 gene is further specified on the basis of the descriptions herein, it can be used by those skilled 
in the art after nucleic acids such as other regions than the Rf-1 gene or intron regions in the Rf-1 gene are removed. 
A given amino acid may be replaced, for example, by a residue having similar physiochemical characteristics. Examples 
of such conservative substitutions include changes from one aliphatic residue to another, such as changes from one 
to another of lie, Val, Leu, or Ala; changes from one polar residue to another, such as changes between Lys and Arg, 
Glu and Asp, or Gin and Asn; or changes from one aromatic residue to another, such as changes from one to another 
of Phe, Trp, or Tyr. Other well-known conservative substitutions include e.g. changes between entire regions having 
similar hydrophobic characteristics. Those skilled in the art can introduce desired deletions, insertions or substitutions 
by well-known gene engineering techniques using e.g. site-specific mutagenesis as described in Sambrook et al., 
Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor Laboratory Press, (1 989). 
[0056] We compared an indica variety IR24 carrying the Rf-1 gene (SEQ ID NO: 27) with japonica varieties not 
carrying it such as Asominori (SEQ ID NO: 28) and a Nipponbare BAC clone deposited with GenBank (Accession No. 
AC068923). As a result, we found that the RM region of the indica variety containing the Rf-1 gene has at least the 
following single bases polymorphisms (SNP). 



1) a base corresponding to the base 1239 of SEQ ID NO: 27 is A; 

2) a base corresponding to the base 6227 of SEQ ID NO: 27 is A; 

3) a base corresponding to the base 20680 of SEQ ID NO: 27 is G; 

4) a base corresponding to the base 45461 of SEQ ID NO: 27 is A; 

5) a base corresponding to the base 49609 of SEQ ID NO: 27 is A; 

6) a base corresponding to the base 56368 of SEQ ID NO: 27 is T; 

7) a base corresponding to the base 57629 of SEQ ID NO: 27 is C; and 

8) a base corresponding to the base 66267 of SEQ ID NO: 27 is G. 



[0057] Thus, nucleic acids containing the RM region of the present invention preferably meet one to all of the re- 
quirements 1 )-8) above. 

[0058] In Example 3 below, the chromosomal organizations of recombinants proximal to the Rf-1 gene (RS1-RS2, 
RC1 -RC8) were tested in the RM region. The results showed that a sequence determining the presence of the function 
of the Rf-1 gene is contained in the base sequence of bases 1239-66267 of SEQ ID NO: 27, i.e. in a region from the 
P4497 Mbol to B56691 Xbal loci (about 65 kb) as estimated at maximum (Fig. 3). However, there is a possibility that 
it is important for the expression of the genetic function of the RM gene that the RM gene is partially of the indica 
genotype, and that the genetic function may not be significantly changed whether the remaining regions are of the 
japonica or indica genotype. There may be an extreme case that the coding region is completely identical and only the 
promoter region is different between japonica and indica, and that the promoter region and the coding region are only 
partially included in the region from P4497 the Mbol to B56691 Xbal loci (about 65 kb). Therefore, it cannot be concluded 
that the common indica region above (bases 1239-66267 of SEQ ID NO: 27) completely contains the entire Rf-1 gene. 
However, it is thought that at least SEQ ID NO: 27 completely contains the entire Rf-1 gene for the following reasons! 

1) the size of a gene is normally several kilobases, and rarely exceeds 10 kb; 

2) the genomic base sequence of IR24 determined by the present invention (SEQ ID NO: 27) completely contains 
the common indica region above; 

3) the 5' end of SEQ ID NO: 27 is located 1238 bp upstream of the 5' end of the common indica region above and 
forms a part of another gene (S 12564); and 

4) the 3' end of SEQ ID NO: 27 is located 1 0096 bp downstream of the 3' end of the common indica region above. 

[0059] In this way, we first succeeded in restricting the region of the RM gene to 76 kb. Thus, nucleic acids containing 
the region of the Rf-1 gene of the present invention are extremely less likely to contain other genes proximal to the Rf- 
1 gene as compared with those selected with the co-dominant marker locus at a genetic distance of about 1 cM (about 
300 kb) from the RM gene described in a prior documents such as Japanese Patent Public Disclosure No 
2000-139465. Moreover, they are less likely to contain other genes than those selected with the DNA marker loci 
S12564Tsp509l and C1361 Mwol (at a distance of about 0.3 cM between them) described in our prior Japanese Patent 
Application No. 2000-247204. 
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[0060] We further confirmed by complementation assays that the Rf-1 gene is completely contained in especially 
bases 38538-54123 of the base sequence of SEQ ID NO: 27. In an embodiment of the present invention, therefore 
the base sequence at least 70% identical to the base sequence of SEQ ID NO: 27 or to the base sequence of bases' 
38538-54123 of SEQ ID NO: 27 meets at least one of the following requirements 1) and 2): 

1) a base corresponding to the base 45461 of SEQ ID NO: 27 is A; 

2) a base corresponding to the base 49609 of SEQ ID NO: 27 is A. 



IV. Method for restoring rice fertility 



[0061] The present invention provides a method for restoring rice fertility comprising introducing a nucleic acid into 
rice, wherein the nucleic acid has the base sequence of SEQ ID NO.27, or has a base sequence which is identical to 
at least 70% of the base sequence of SEQ ID NO.27, and which functions to restore fertility. The methods of the present 
invention may comprise introducing a nucleic acid into rice, wherein the nuclei acid has a portion of SEQ ID NO 27 
especially bases 38538-54123, preferably bases 42357-53743, more preferably bases 42132-48883 of SEQ ID NO: 
27 or has a base sequence which is identical to at least 70% identical of the base sequence of bases 38538-54123 
preferably bases 42357-53743, more preferably bases 421 32-48883 of SEQ ID NO: 27 and, which functions to restore 
fertility. 

[0062] In the present invention, the nucleic acid containing the locus of a fertility restorer gene (Rf-1) that can be 
introduced into rice can be any one of the nucleic acids described above in "III. Nucleic acids containingthe Rf-1 locus". 
The method for introducing the nucleic acid into rice is not specifically limited but can be any known method. Nucleic 
acids of the present invention can be introduced by known genetic engineering techniques or crossing. Genetic engi- 
neering techniques are preferably used because inclusion of other neighboring genes can be prevented and the period 
for establishing a line can be shortened. 

[0063] Any suitable expression system for transduction by genetic engineering techniques can be employed. Re- 
combinant expression vectors comprise a nucleic acid containing a fertility restorer gene (Rf-1) of the invention that 
can be introduced into rice, operably linked to suitable transcriptional or translational regulatory base sequences, such 
as those derived from a mammalian, microbial, viral, or insect gene. 

[0064] Examples of regulatory sequences include transcriptional promoters, operators, or enhancers, an mRNA ri- 
bosomal binding site, and appropriate sequences which control transcription and translation initiation and termination. 
Base sequences are operably linked to a regulatory sequence when the regulatory sequence is functionally associated 
with the DNA sequences. Thus, a promoter base sequence is operably linked to a DNA sequence if the promoter base 
sequence controls the transcription of the DNA sequence. An origin of replication that confers the ability to replicate 
in rice, and a selection gene by which transformants are identified, are generally incorporated into expression vectors 
As for selectable markers, those commonly used can be used by standard methods. Examples are genes resistant to 
antibiotics such as tetracycline, ampicillin, kanamycin, neomycin, hygromycin or spectinomycin. 
[0065] In addition, a sequence encoding an appropriate signal peptide (native or heterologous) can be incorporated 
into expression vectors. A DNA sequence for a signal peptide (secretory leader) may be fused in frame to a nucleic 
acid sequence of the invention so that the DNA is initially transcribed, and the mRNA translated into a fusion protein 
containing the signal peptide. 

[0066] The present invention also provides recombinant vectors containing a gene of the present invention. Methods 
for integrating a DNA fragment of a gene of the present invention into a vector such as a plasmid are described in e 
g. Sambrook, J. et al, Molecular Cloning, A Laboratory Manual (2nd edition), Cold Spring Harbor Laboratory 1 53 
(1989). Commercially available ligation kits (e.g. available from TAKARA) can be conveniently used. Thus obtained 
recombinant vectors (e.g. recombinant plasmids) are transferred into host rice cells. 

[0067] Vectors can be conveniently prepared by linking a desired gene to a recombinant vector available in the art 
(e.g. plasmid DNA) by standard methods. Plant transforming vectors are especially useful for conferring fertility on rice 
using a nucleic acid fragment of the present invention. Vectors for plants are not specifically limited so far as they can 
express the gene of interest in plant cells to produce the protein, but preferably include pBI221, pBI121 (Clontech) 
and vectors derived therefrom. Especially, examples of vectors for transforming rice belonging' to monocotyledons 
include plG121Hm and pTOK233 (Hiei et al., Plant J„ 6, 271-282 (1994)), and pSB424 (Komari et al Plant J 10 
165-174(1996)). ' 
[0068] Transgenic plants can be prepared by replacing the p-glucuronidase (GUS) gene in the above vectors with 
a nucleic acid fragment of the present invention to construct a plant transforming vector and transfecting it into a plant 
The plant transforming vector preferably comprises at least a promoter, a start codon, a desired gene (a nucleic acid 
sequence of the present invention or a part thereof), a stop codon and a terminator. It may also contain a DNA encoding 
a signal peptide, an enhancer sequence, non-translated 5 1 and 3' regions of the desired gene, a selectable marker 
region, etc., as appropnate. Promoters and terminators are not specifically limited so far as they are functional in plant 
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cells, among which constitutive expression promoters include the 35S promoter initially contained in the above vectors 
as well as promoters for actin and ubiquitin genes. 

[0069] Suitable methods for introducing a plasmid into a host cell include the use of calcium phosphate or calcium 
chloride/rubidium chloride, electroporation, electroinjection, chemical treatment with PEG or the like, the use of a gene 
gun described in Sambrook, J. et al., Molecular Cloning, A Laboratory Manual (2nd edition), Cold Spring Harbor Lab- 
oratory, 1 .74(1 989). Plant cells can be transformed by e.g. the leaf disc method [Science, 227, 129 (1 985)] or electro- 
poration [Nature, 31 9, 791 (1 986)). 

[0070] Methods for transferring a gene into a plant include the use of Agrobacterium (Horsch et al., Science, 227,129 
(1985); Hiei et al., Plant J., 6, 271-282(1994)), electroporation (Fromm et al., Nature, 319, 791(1986)), PEG (Pasz- 
kowski et al., EMBO J., 3, 2717(1984)), microinjection (Crossway et al., Mol. Gen. Genet., 202, 179 (1986)), particle 
bombardment (McCabe et al., Bio/Technology, 6, 923(1988)). Methods are not specifically limited so far as they are 
suitable for transfecting a nucleic acid into a desired plant. 

[0071] Transduction by crossing can be performed as follows, for example. First, F 1 obtained by crossing an Rf-1 
donor parent and a japonica variety is backcrossed with the japonica variety. The resulting individuals are screened 
for those homozygous for japonica at the S12564 Tsp509l locus and heterozygous at the P4497 Mbol and B53627 
BstZ1 71 loci and further backcrossed. The resulting individuals are screened for those heterozygous at the P4497 Mbol 
and B56691 Xbal loci and homozygous for japonica at the B53627 BstZ17l locus and further backcrossed. Subse- 
quently, about 10 cycles of screening each backcrossed generation for individuals heterozygous at the P4497 Mbol 
and B56691 Xbal loci and subjecting them to the subsequent backcrossing are repeated. Finally, individuals hetero- 
zygous at the P4497 Mbol and B56691 Xbal loci are self-fertilized and the resulting individuals are screened for those 
homozygous for indica at both loci, whereby a restorer line inheriting a limited chromosomal region from the P4497 
Mbol to B56691 Xbal loci from the Rf-1 donor parent can be obtained. 

[0072] According to the present invention, nucleic acids containing a fertility restorer gene (Rf-1) were isolated, 
whereby the Rf-1 gene can be introduced into a rice variety using genetic engineering tech niques to establish a restorer 
line. The present invention succeeded in restricting the Rf-1 region to 76 kb or less. Therefore, nucleic acids containing 
the RM locus of the present invention are extremely less likely to contain other genes neighboring the Rf-1 gene than 
those of the prior art. Moreover, the entire base sequence of the region containing the Rf-1 gene was determined by 
the present invention. Those skilled in the art can proceed with analysis of the Rf-1 gene itself on the basis of the 
description herein. Thus, only the Rf-1 gene can be introduced without including any neighboring gene. This is espe- 
cially important when neighboring genes bring deleterious traits. Furthermore, restorer lines can be established in a 
shorter period such as 1 -2 years than obtained by crossing. 

[0073] In complementation assays described in Examples 4-13 herein, MS Koshihikari (having BT cytoplasm and a 
core gene substantially identical to Koshihikari) was actually transformed by an Agrobacterium-mediated method using 
fragments from 1 0 clones described in Fig. 5. The results demonstrated that fertility restorer lines are established from 
a nucleic acid containing the base sequence of bases 38538-54123, preferably bases 42357-53743, more preferably 
bases 421 32-48883 of SEQ ID NO: 27. 

[0074] Agrobacterium-mediated methods for establishing rice restorer lines are described in, but not limited to, Hiei 
et al., Plant J. ,6, pp. 271-282(1994), Komarietal., Plant J. ,10, p.1 65-1 74(1 996), Ditta etal., Proc. Natl. Acad. Sci USA 
77: pp. 7347-7351(1980), etc. 

[0075] First, a plasmid vector containing a nucleic acid fragment of interest to be inserted is prepared. Suitable 
plasmid vectors include e.g. pSB11 , pSB22 and the like having a plasmid map described in Komari et al., Plant J., 10, 
pp. 1 65-1 74 (1 996), supra. Alternatively, those skilled in the art can also construct an appropriate vector by themselves 
on the basis of plasmid vectors such as pSB1 1 , pSB22 described above. In the examples herein below, an intermediate 
vector pSB200 having a hygromycin-resistant gene cassette was prepared on the basis of pSB11 , and used. Specifi- 
cally, a nopaline synthase terminator (Tnos) was first fused to a ubiquitin promoter and a ubiquitin intron (Pubi-ubil). 
A hygromycin-resistant gene (HYG(R)) was inserted between ubil and Tnos of the resulting Pubi-ubil-Tnos complex 
to give a Pubi-ubil-HYG(R)-Tnos assembly. This assembly was fused to a Hindlll/EcoRI fragment of pSB11 (Komari 
et al., supra.) to give pKY205. Linker sequences for adding restriction enzyme sites Notl, NspV, EcoRV, Kpnl, Sad, 
Eco Rl were inserted into the Hind I II site upstream of Pubi of this pKY205 to give pSB200 having a hygromycin-resistant 
gene cassette. 

[0076] Then , E. coli cells (e.g. DH5a, JM1 09, MV1 1 84, all commercially available from e.g. TAKARA) are transformed 
with the recombinant vector containing the nucleic acid inserted. 

[0077] Thus transformed E. coli cells are used for triparental mating with an Agrobacterium strain preferably in com- 
bination with a helper E. coli strain according to e.g. the method of Ditta et al. (1980). Suitable Agrobacterium strains 
include Agrobacterium tumefaciens strains such as LBA4404/pSB1 , LBA4404/pNB1, LBA4404/pSB3, etc. They all 
have a plasmid map described in Komari et al., Plant J., 10, pp. 165-174 (1996), supra, and can be used by those 
skilled in the art by constructing a vector by themselves. Suitable helper E. coli strains include, but not limited to, e.g. 
HB101/pRK2013 (available from Clontech). A report shows that E. coli cells carrying pRK2073 can also be used as 
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helper E. coli though they are less common (Lemas et al., Plasmid 1992, 27, pp. 161-163). 

[0078] Then, the Agrobacterium cells mated as intended are transformed Into male sterility rice according to e.g. the 
method of Hiei et al (1 994). Necessary immature rice seeds for transformation can be prepared by e.g. pollinating male 
sterility rice with a japonica variety. 

[0079] Fertility restoration in transformed plants can be assessed by e.g. evaluating seed fertility in standing plants 
about one month after heading. Evaluation on standing plants means observation of plants grown in a field or the like. 
An alternative method is a laboratory study of grain ripening percentages in the ear. 

V. Methods for discerning the presence of the Rf-1 gene 

[0080] According to the present invention, it was shown that a sequence determining the presence of the function 
of the Rf-1 gene is located between the polymorphism-detecting marker loci P4497 Mbol and B56691 Xbal on rice 
chromosome 1 0. Moreover, complementation assays confirmed that the Rf-1 gene is completely contained in especially 
bases 38538-54123 of the base sequence of SEQ ID NO: 27. 

[0081] Comparison of the base sequence of an indica variety carrying the Rf-1 gene (IR24) (SEQ ID NO: 27) with 
those of japonica varieties not carrying said gene (Asominori (SEQ ID NO: 28) and Nipponbare BAC clone AC068923) 
revealed the presence of polymorphisms between both varieties. As a result, it became possible to conveniently, rapidly 
and exactly discern whether or not a rice plant or seed under test carries the Rf-1 gene on the basis of polymorphisms 
in base sequence in regions neighboring the Rf-1 gene. 

[0082] Therefore, the present invention also provides a method for discerning whether or not a subject rice individual 
or a seed thereof has the Rf-1 gene or not, wherein the method utilizing a fact that a sequence determining the presence 
of the function of the Rf-1 gene positions between the polymorphism detection marker loci P4497 Mbol and B56691 
Xba I on rice chromosome 10. 

[0083] Polymorphisms can be detected by any known method. For example, known methods include assays for 
restriction fragment length polymorphisms (RFLPs); direct determination by sequencing; cutting a genomic DNA with 
a 8-base recognizing restriction enzyme, and then radioactivelly labeling the ends and further cutting the labeled digest 
with 6-base and 4-bases recognizing restriction enzyme and then developing the digest by two-dimensional electro- 
phoresis (RLGS, Restriction Landmark Genome Scanning); etc. AFLP analysis (amplified fragment length polymor- 
phism; P. Vos et al., Nucleic Acids Res. Vol. 23, pp. 4407-4414 (1995)) has also been developed wherein RFLP is 
amplified/detected by polymerase chain reaction (PCR). 

[0084] For example, conventional methods involved detecting RFLPs via PCR amplification (conversion of RFLP 
markers into PCR markers) or detecting polymorphisms in microsatellites via PCR amplification (microsatellite markers) 
as illustrated below. 

Conversion of RFLP markers into PCR markers 
[0085] 

A. PCR markers based on polymorphisms in genomic regions corresponding to RFLP probes (D.E. Harry, B. 
Temesgen, D.B. Neale; Codominant PCR-based markers for Pinus taeda developed from mapped cDNA clones, 
Theor. Appl. Genet. (1998) 97: pp. 327-336). After performing genomic PCR using primers designed for an RFLP 
marker probe sequence ("RFLP" is a polymorphism observed by Southern analysis using a DNA fragment as a 
probe. The base sequence of the DNA fragment used as a probe is called "RFLP marker probe sequence"), a 
PCR marker can be prepared by either of the following two procedures. A first procedure involves treating the 
products with a series of restriction enzymes to search for a restriction enzyme causing a fragment length poly- 
morphism, and a second procedure involves searching for a polymorphism by varietal comparison of the base 
sequences of the products and preparing a PCR marker based on the polymorphism. 

B. PCR markers based on identification of RFLP-causing sites. A PCR marker can be obtained by identifying an 
RFLP-causing site (a restriction enzyme recognition site carried by only one of two varieties compared) present 
in or near (normally within several kbs) an RFLP marker probe sequence. 

Microsatellite markers 

[0086] Microsatellites are repeat sequences of about 2 to 4 bases such as (CA) n that are present in great numbers 
in genomes. If a varietal polymorphism occurs in repetition number, a polymorphism can be observed in PCR product 
length by PCR using primers designed in adjacent regions, whereby the DNA polymorphism can be detected. Markers 
for detecting polymorphisms using microsatellites are called microsatellite markers (O. Parnaud, X. Chen, S.R Mc- 
Couch, Mol. Gen. Genet. (1996) 252: pp. 597-607). 
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[0087] Methods for detecting polymorphisms in the present invention are not specificaily limited. From the viewpoint 
of efficiency and convenience, PCR-RFLP is preferred, which is a combination of PCR and RFLP to identify polymor- 
phisms from their restriction enzyme cleavage patterns in cases where they exist among variety lines at restriction 
enzyme recognition sites in the sequences of DNA fragments amplified by PCR. PCR-RFLP is also called CAPS 
(cleaved amplified polymorphic sequence). If any suitable restriction enzyme recognition site is not present in a region 
showing polymorphisms, a modified CAPS called dCAPS (derived cleaved amplified polymorphic sequence) can also 
be used wherein restriction enzyme sites are introduced during PCR (Michaels, S.D. and Amasino, R.M. (1998), The 
Plant Journal 14(3) 381-385; A. Konieczny et al.,(1993), Plant J.4(2) pp. 403-410; Neff, M.M., Neff, J.D., Chory, J. and 
Pepper, A.E. (1998), The Plant Journal 14(3) 387-392). These methods are explained in more detail below. 

CAPS, dCAPS 

[0088] The method for discerning of the present invention comprise, but not limited to: 

i) preparing a pair of primers based on the base sequences of a site showing a polymorphism in the base sequences 
between indica and japonica varieties at the Rf-1 locus and its adjacent regions to amplify said base sequences; 

ii) performing nucleic acid amplification reaction(s) using the genomic DNA of the subject rice individual or the 
seed thereof as a template; and 

iii) discerning whether or not the subject rice individual or the seed thereof has the Rf-1 gene based on the poly- 
morphism found in the nucleic acid amplification product. 

[0089] The step of preparing a primer pair in i) preferably comprises any of the following means: 

a) when a change containing a deleted region exists in the polymorphism in the nucleic acid amplification product, 
preparing a pair of primers for nucleic acid amplification to flank the deleted region to form a marker for detecting 
the polymorphism; 

b) when a base change causing a difference in restriction enzyme recognition exists in the polymorphism in the 
nucleic acid amplification product, preparing a pair of primers for nucleic acid amplification to flank the base change 
site to form a marker for detecting the polymorphism; or 

c) when a base change causing no difference in restriction enzyme recognition exists in the polymorphism in the 
nucleic acid amplification product, preparing a pair of primers for introducing a mismatch, wherein pair of primers 
contain the base change site and alters a region containing the base change site into a base sequence causing 
a difference in restriction enzyme recognition in the nucleic acid amplification product to form a marker for detecting 
the polymorphism. 

[0090] Suitable polymorphic sites for discerning the presence of the Rf-1 gene in the present invention can be ap- 
propriately selected so that a polymorphism detecting marker can be prepared as described below on the basis of 
comparison of, but not limited to, the base sequence of an indica variety carrying the Rf-1 gene (IR24) (SEQ ID NO: 
27) with those of japonica varieties not carrying said gene (Asominori (SEQ ID NO: 28) and Nipponbare BAC clone 
AC068923). 

[0091 ] If the polymorphism found causes a difference in restriction enzyme recognition, for example, a pair of primers 
for nucleic acid amplification are prepared to flank the polymorphic site and used for detecting the polymorphism. 
Primers are preferably designed not to be specific for highly repeated sequences to avoid undesired products. If the 
polymorphism found does not cause a difference in restriction enzyme recognition, markers can be prepared by ap- 
plying the dCAPS method described above. Primers for dCAPS markers are preferably designed not to be specific for 
repeat sequences and to provide a product length of preferably 50-300 bases, more preferably about 100 bases to 
ease identification of polymorphisms. 

[0092] If the polymorphism found involves a microsatellite, nucleic acid amplification primers are prepared to flank 
the microsatellite and used to detect the polymorphism. Again, the primers are preferably designed not to be specific 
for repeat sequences. 

1) Nucleic acid amplification 

[0093] In the present invention, a pair of primers are preferably prepared for amplifying adjacent regions containing 
polymorphisms on the basis of the determined base sequence of the nucleic acid of a subject rice individual or seed 
at the Rf-1 locus. The primer pair is used to perform a nucleic acid amplification reaction with the genomic DNA of the 
subject rice individual or seed as a template. The nucleic acid amplification reaction is preferably polymerase chain 
reaction (PCR) (Saiki et al., 1985, Science 230, pp. 1350-1354). 
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[0094] The pair of primers for nucleic acid amplification can be prepared by any known method on the basis of the 
base sequence of a poiymorphic site and adjacent regions thereto. Specifically, a primer pair can be prepared on the 
basis of the base sequence of a polymorphic site and adjacent regions thereto by a process comprising generating a 
single-stranded DNA having the same base sequence as the base sequence of the polymorphic site and adjacent 
regions thereto or a base sequence complementary to said regions or, if necessary, generating the single-stranded 
DNA containing a modification without affecting the binding specificity to the base sequence of the polymorphic site 
and adjacent regions thereto provided that the following conditions are satisfied: 

1) the length of each primer should be 15-30 bases; 

2) the proportion of G+C in the base sequence of each primer should be 30-70%; 

3) the distribution of A, T, G and C in the base sequence of each primer should not be partially largely uneven; 

4) the length of the nucleic acid amplification product amplified by the primer pair should be 50-3000 bases, pref- 
erably 50-300 bases; and 

5) any complementary sequence segment should not occur with the base sequence of each primer or between 
the base sequences of the primers. 

[0095] As used herein, the "adjacent regions" to a polymorphic site mean that an area containing both of a polymor- 
phic site and adjacent regions thereto is within a distance suitable for nucleic acid amplification, preferably PCR. The 
adjacent regions amplified preferably have a length within the range of, but not limited to, about 50 bases to about 
3000 bases, more preferably about 50 bases to about 2000 bases. To facilitate identification of polymorphisms, the 
product length is preferably 50-300 bases, more preferably about 100 bases. The adjacent regions preferably have a 
length within the range of, but not limited to, about 0 to about 3000 bases, more preferably about 0 to about 2000 
bases, still more preferably about 0 to about 1000 bases on the 5' or 3' side of a polymorphic site. 
[0096] Procedures and conditions for the nucleic acid amplification reaction are not specifically limited and are well 
known to those skilled in the art. Appropriate conditions can be applied by those skilled in the art depending on various 
factors such as the base sequence of the polymorphic site and adjacent regions thereto, the base sequence and length 
of the primer pair, etc. Generally, the nucleic acid amplification reaction can be performed under more stringent con- 
ditions (annealing reaction and nucleic acid elongation reaction at higher temperatures and less cycles) as the primer 
pair is longer or the proportion of G+C is higher or the distribution of A, T, G and C is evener. The use of more stringent 
conditions allows an amplification reaction with higher specificity. 

[0097] The amplification reaction can be performed under conditions of, but not limited to, one cycle of 94°C for 2 
min, 30 cycles of 94°C for 1 min, 58 °C for 1 min and 72°C for 2 min, and finally one cycle of 72°C for 2 min using 50 
ng of a genomic DNA as a template, 200 uJvl of each dNTP and 5 U of ExTaq™ (TAKARA). The reaction can also be 
performed under conditions of one cycle of 94°C for 2 min, 30 cycles of 94°C for 1 min, 58°C for 1 min and 72°C for 
1 min, and finally one cycle of 72°C for 2 min. In another embodiment, the reaction can also be performed under 
conditions of one cycle of 94°C for 2 min, 35 cycles of 94°C for 30 sec, 58°C for 30 sec and 72°C for 30 sec, and finally 
one cycle of 72°C for 2 min. S 
[0098] The subject rice (test rice) genomic DNA used as a template for PCR can be easily extracted from individuals 
or seeds by the method of Edwards et al. (Nucleic Acids Res. 8(6):1349, 1991). More preferably, DNA purified by 
standard techniques is used. An especially preferred extraction method is the CTAB method (Murray, M.G. et al., 
Nucleic Acids Res. 8(1 9):4321 -5, 1 980). The DNA is preferably used as a template for PCR at a final concentration of 
0.5 ng/ul. 

2) Preparation of markers for detecting polymorphisms 

[0099] After examining whether or not a polymorphism is detected in the amplification product by the nucleic acid 
amplification reaction with a pair of primers, a marker for detecting the polymorphism is prepared on the basis of the 
polymorphism found. Non-limiting examples of polymorphisms that can be detected in the amplification product are 
as follows. 

a) A change containing a deleted region exists in the polymorphism in the nucleic acid amplification product. 

[0100] In this case, a pair of primers for nucleic acid amplification are prepared to flank the deleted region to form a 
marker for detecting the polymorphism. If the deleted region has a sufficient size, the polymorphism can be detected 
from the difference in mobility by electrophoresing the amplification product on an agarose gel or an acrylamide gel, 
for example. The polymorphism can be detected when the difference in base pair numbers is about 5% or more in the 
case of agarose gel electrophoresis or when the difference in length is about 1 base or more in the case of sequencing 
acrylamide gel electrophoresis, for example, Alternatively, the polymorphism can be detected by hybridizing the nucleic 
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acid amplification product using an oligobase or a DNA fragment having a complementary sequence to the base se- 
quence excluding the deleted region as an analytical probe. Alternatively, the polymorphism can be confirmed by 
determining the base sequence of the amplification product, if desired. Known techniques for electrophoresis of nucleic 
acids, hybridization, sequencing and the like can be used as appropriate by those skilled in the art. In this case, the 
difference in the length of the amplification product directly reflects the polymorphism and markers for detecting poly- 
morphisms on this basis are called ALP (amplicon length polymorphism) markers. 

b) A base change causing a difference in restriction enzyme recognition exists in the polymorphism in the nucleic acid 
amplification product. 

[01 01 ] In this case, a pair of primers for nucleic acid amplification are prepared to flank the base change site to form 
a marker for detecting the polymorphism. In this case, a base change causing a difference in restriction enzyme rec- 
ognition occurs in the polymorphism of the nucleic acid amplification product, i.e. the nucleic acid amplification product 
may be cleaved or not with one or more specific restriction enzymes. Thus, the amplification product can be treated 
with the restriction enzymes and electrophoresed on e.g. an agarose gel to detect the polymorphism from the difference 
in mobility. The polymorphism can be confirmed by determining the base sequence of the amplification product, if 
desired. 

[0102] In this case, the difference in the length of the restriction fragment of the amplification product by PCR or the 
like reflects the polymorphism and markers for detecting polymorphisms on this basis are called CAPS markers or 
20 pcR-RFLP markers (A. Konieczny et al., supra.) 

[0103] This is exemplified by primer pairs P4497 Mbol, P23945 Mbol, P41030Taql, P451 77 BstUI, B59066 BsaJI 
and B56691 Xbal in Example 1 below. Even if the polymorphism can be detected by the length of the nucleic acid 
amplification product as described in a) above, the polymorphism can be more easily detected by combination with 
restriction enzyme treatment. 
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c) A base change causing no difference in restriction enzyme recognition exists in the polymorphism in the nucleic 
acid amplification product. 

[0104] In this case, a pair of primers for introducing a mismatch are prepared that contains the base change site and 
alters a region containing the base change site into a base sequence causing a difference in restriction enzyme rec- 
ognition in the nucleic acid amplification product to form a marker for detecting the polymorphism. 
[0105] Specifically, a pair of primers based on the base sequences of regions naturally proximal to the Rf-1 gene 
cause a polymorphism in the nucleic acid amplification product but no difference in restriction enzyme recognition, and 
therefore, a mismatch is introduced into one or both of the primers to alter a region containing the base change site 
(polymorphism) into a base sequence causing a difference in restriction enzyme recognition in the nucleic acid ampli- 
fication product. For example, the method described in Mikaelian et al., Nucl. Acids. Res. 20:376.1992 can be used 
as a standard technique for substituting, deleting or adding a specific base by PCR-mediated site-specific mutagenesis. 
The amplification product using the mismatch-introducing primers as a marker for detecting the polymorphism may be 
cleaved or not with one or more specific restriction enzymes because it has a difference in restriction enzyme recognition 
at the mismatch-introducing site. Therefore, the amplification product can be treated with the restriction enzymes and 
electrophoresed on e.g. an agarose gel to detect the polymorphism from the difference in mobility, as described in b) 
above. 

[0106] The introduction of a mismatch must not affect not only the binding of the primers to a target plant genome 
but also the polymorphic base change. The polymorphic base change is used to introduce a mismatch near it so that 
a difference in restriction enzyme recognition occurs by a combination of both base change and mismatch. Methods 
for introducing such a mismatch are known to those skilled in the art and described in detail in Michaels, S.D. and 
Amasino, R.M. (1998), Neff, M.M., Neff, J.D., Chory, J. and Pepper, A.E. (1998), for example. 
[0107] Markers in this case are improved CAPS markers described in b) above and called dCAPS (derived CAPS) 
markers. This is exemplified by P9493 Bsll in Example 3 below. 

[0108] If there are many extra restriction sites unrelated to varietal polymorphisms in the case of b) or c) above, it 
may be difficult to discern any difference in restriction site recognition based on polymorphisms. In this case, a mismatch 
may be introduced into a primer as appropriate to abolish unnecessary restriction sites. For example, a mismatch was 
introduced into the R-primer to abolish the Mspl site unrelated to polymorphisms in B60304 Mspl inExample 3 
[0109] Although the invention is not limited to any specific method, CAPS or dCAPS methods have several advan- 
tages over other RFLP methods. Specifically, analyses can be made with smaller amounts of samples than in RFLP 
for example. Another advantage is that the time and labor required for analyses can be greatly reduced. Polymorphisms 
detected with PCR markers can be visualized by agarose gel electrophoresis that is easier than acrylamide gel elec- 
trophoresis used for microsatellite markers. * 
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Preferred embodiments of the discerning method of the present invention 

[0110] Preferred embodiments of the method for discerning whether or not a subject rice has the RM gene are 
described below for illustrative purposes. In the examples herein, it was found that the base sequence of an indica 
variety IR24 carrying the RM gene (SEQ ID NO: 27) has at least the following polymorphisms 1 )-8) as compared with 
corresponding regions of japonica varieties: 

1) a base corresponding to the base 1239 of SEQ ID NO: 27 is A; 

2) a base corresponding to the base 6227 of SEQ ID NO: 27 is A; 

3) a base corresponding to the base 20680 of SEQ ID NO: 27 is G; 

4) a base corresponding to the base 45461 of SEQ ID NO: 27 is A; 

5) a base corresponding to the base 49609 of SEQ ID NO: 27 is A; 

6) a base corresponding to the base 56368 of SEQ ID NO: 27 is T; 

7) a base corresponding to the base 57629 of SEQ ID NO: 27 is C; and 

8) a base corresponding to the base 66267 of SEQ ID NO: 27 is G. 

[0111 J In preferred embodiments of the present invention, therefore, the subject rice individual or seed is judged as 
carrying the RM gene when one to all of the requirements 1 )-8) above are met. 

[0112] We further verified that a region essential for the expression of the function of the RM gene is contained in 
especially bases 38538-54123, preferably bases 42357-53743, more preferably bases 42132-48883 in the base se- 
quence of SEQ ID NO: 27. In an embodiment of the present invention, therefore, the subject rice individual or seed is 
determined to have the RM gene in the case that the nucleic acid having a base sequence which is identical to at 
least 70% of the base sequence of SEQ ID NO.27 or of the base sequence of bases 38538-54123 of SEQ ID N0.27, 
meets at least one of the following requirements 1 ) and 2): 

1) a base corresponding to the base 45461 of SEQ ID NO.27 is A; and 

2) a base corresponding to the base 49609 of SEQ ID NO.27 is A. 

[0113] Known polymorphism detecting methods can be used to determine whether or not the above requirements 
are met. The base sequence of adjacent regions containing said sequence can also be directly determined. However, 
CAPS or dCAPS methods described above are preferably used because they are rapid and convenient. CAPS or 
dCAPS methods can be performed by a protocol comprising, for example: 

i) preparing a pair of primers based on a base sequence of adjacent regions including any one of the following base; 

1) a base corresponding to the base 1239 of SEQ ID NO: 27; 

2) a base corresponding to the base 6227 of SEQ ID NO: 27; 

3) a base corresponding to the base 20680 of SEQ ID NO: 27; 

4) a base corresponding to the base 45461 of SEQ ID NO: 27; 

5) a base corresponding to the base 49609 of SEQ ID NO: 27; 

6) a base corresponding to the base 56368 of SEQ ID NO: 27; 

7) a base corresponding to the base 57629 of SEQ ID NO: 27; and 

8) a base corresponding to the base 66267 of SEQ ID NO: 27 is G. 

to amplify both the base of the above and adjacent regions thereto; 

ii) performing nucleic acid amplification reaction(s) using the genome DNA of the subject rice individual or the seed 
thereof as a template; and 

iii) discerning the presence of the RM in the subject rice individual or the seed thereof based on polymorphism 
found in said nucleic acid amplification product. 

[0114] The detection of polymorphisms in the nucleic acid amplification product is performed by, but not limited to, 
discerning the subject rice individual or seed to have the RM gene when one to all of the requirements 1)-8) below 
are met: 

1 ) a region including a base corresponding to the base 1 239 of SEQ ID NO: 27 does not have any Mbol recognition 
sequence; 

2) a region including a base corresponding to the base 6227 of SEQ ID NO: 27 does not have any Bsll recognition 
sequence; 
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3) a region including a base corresponding to the base 20680 of SEQ ID NO: 27 does not have any Mbol recognition 
sequence; 

4) a region including a base corresponding to the base 45461 of SEQ ID NO: 27 does not have any Taql recognition 
sequence; 

5 5) a region including a base corresponding to the base 49609 of SEQ ID NO: 27 does not have any BstUI recognition 

sequence; 

6) a region including a base corresponding to the base 56368 of SEQ ID NO: 27 does not have any Mspl recognition 
sequence; 

7) a region including a base corresponding to the base 57629 of SEQ ID NO: 27 does not have any BsaJI recognition 
10 sequence; and 

8) a region including a base corresponding to the base 66267 of SEQ ID NO: 27 does not have any Xbal recognition 
sequence. 

[01 15] However, the present invention is not limited to the restriction enzymes above so far as each polymorphism 
15 jn the specific regions 1)-8) above can be detected. 

[0116] Preferably, identification methods of the present invention comprise: 

i) preparing a pair of primers based on a base sequence of adjacent regions including any one of the following base; 

20 1 ) a base corresponding to the base 45461 ; or 

2) a base corresponding to the base 49609; to amplify both the base of the above and adjacent regions thereto; 

ii) performing nucleic acid amplification reaction(s) using the genome DNA of the subject rice individual or the seed 
thereof as a template; and 

25 jjj) discerning the presence of the RM in the subject rice individual or the seed thereof based on polymorphism 

found in said nucleic acid amplification product. The subject rice individual or seed thereof is determined to have 
the RM gene in step iii), although not limited to, when at least one of the following requirements 1 ) and 2) is met: 

1) a region including a base corresponding to the base 45461 of SEQ ID NO: 27 does not have any Taql 
30 recognition sequence; 

2) a region including a base corresponding to the base 49609 of SEQ ID NO: 27 does not have any BstUI 
recognition sequence. 

[0117] Primer pairs used for the amplification reaction can be appropriately selected by those skilled in the art to 
35 preferably satisfy the conditions above on the basis of the base sequence of SEQ ID NO: 27. Preferably, any primer 
pair having a base sequence selected from the group consisting of SEQ ID NOS: 39 and 40, SEQ ID NOS: 41 and 42, 
SEQ ID NOS: 43 and 44, SEQ ID NOS: 45 and 46, SEQ ID NOS: 47 and 48, SEQ ID NOS: 49 and 50, SEQ ID NOS: 
51 and 52, and SEQ ID NOS: 53 and 54 is used. More preferably, the primer pair is selected from the group consisting 
of SEQ ID NOS: 45 and 46, and SEQ ID NOS: 47 and 48. If necessary, the sequences of the above primer pairs 
40 containing substitutions, deletions or additions while retaining the binding specificity for the base sequence of the 
polymorphic site and adjacent regions thereto can also be used as primers. 

[0118] To examine the resulting PCR product for restriction fragment length polymorphisms, it is cleaved with re- 
striction enzymes corresponding to the restriction sites present in PCR markers. Such cleavage is accomplished by 
incubation for several hours to a day at the recommended reaction temperature for the restriction enzymes used. The 
45 PCR amplified sample cleaved with the restriction enzymes can be analyzed by electrophoresis on an about 0.7% - 
2% agarose gel or an about 3% MetaPhor™ agarose gel. The gel is visualized under UV light in ethidium bromide, for 
example. 

[0119] In the most preferred embodiments of the present invention, restriction enzyme cleavage patterns show the 
bands as shown in Table 2 below on the visualized gel depending on the primer pair used. 



Table 2 


i Approximate size (bp) of detected band 


Amplified with P4497 Mobl (SEQ ID NOS: 39 and 40) Restriction enzyme 
Mbol 


j 730 

i 


Test rice genome having the Rf-1 gene (homozygous) 
no 


385, 345 
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Table 2 (continued) 







Approximate size (bp) of detected band 


5 


Amplified with P4497 Mobl (SEQ ID NOS: 39 and 40) Restriction enzyme 
Mbol 


730 




Amplified with P9493 Bsll (SEQ ID NOS: 41 and 42) Restriction enzyme 
Bsll 


126 




Test rice genome having the Rf-1 gene (homozygous) 




10 


no 


100,26 




Amplified with P23945 Mbol (SEQ ID NOS: 43 and 44) Restriction 
enzyme Mbol 




15 


Test rice genome having the Rf-1 gene (homozygous) 
no 


160, 100 
260 




Amplified with P41030 Taql (SEQ ID NOS: 45 and 46) Restriction 
enzyme Taql 




20 


Test rice genome having the Rf-1 gene (homozygous) 
no 


280 
90, 190 




Amplified with P45177 BstUI (SEQ ID NOS: 47 and Restriction enzyme 
BstUI 


48) 




Test rice genome having the Rf-1 gene (homozygous) 
no 


20,65,730 
20,65,175,555 




Amplified with B60304 Mspl (SEQ ID NOS: 49 and 50) Restriction 
enzyme Mspl 




30 


Test rice genome having the RM gene (homozygous) 
no 


330 

220, 110 




Amplified with B59066 BsaJI (SEQ ID NOS: 51 and Restriction enzyme 
BsaJI 52) 




35 


Test rice genome having the RM gene (homozygous) 
no 


420 

65, 355 




Amplified with B56691 Xbal (SEQ ID NOS: 53 and 54) Restriction 
enzyme Xbal 




40 


Test rice genome having the Rf-1 gene (homozygous) 
no 


670 

140,530 



[01 20] In Example 3 below, recombinants proximal to the RM gene having pollen fertility (RS1-RS2, RC1-RC8) were 
tested for the chromosomal organization of the RM region using 1 4 polymorphic markers including the 8 primer pairs 
described above. As a result, it was confirmed that all the plants carry the RM gene derived from the indica variety 
between P9493 Bsll and 59066 BsaJI. This result showed that recombinant pollens having the chromosomal organi- 
zation as shown in Fig. 3 have pollen fertility, i ,e. the RM gene is functional in these pollens. This means that a sequence 
determining the presence of the function of the Rf-1 gene is included in the indica region common to these recombinant 
pollens, i.e. in a region from the P4497 Mbol to B56691 Xbal loci (about 65 kb) as estimated at maximum. 
[01 21 ] In the present invention, chromosomal walking was started on the presumption that the S1 2564 Tsp509l locus 
should be vary proximal to the RM locus as judged from the frequency of appearance of individuals by crossing. In 
fact, the genetic distance between both loci has been calculated to be about 0.04 cM as the result of the high-precision 
segregation analysis of the present invention. Even one of markers known to be most closely linked to the RM locus 
as described in Japanese Patent Public Disclosure No. 2000-139465 is reported to have a genetic distance of 1 cM 
from the Rf-1 locus. Considering that 1 cM is estimated to be equivalent to 300 kb on average in rice, a considerable 
time should be required to restrict the Rf-1 gene region if chromosomal walking were started from the marker described 
in Japanese Patent Public Disclosure No. 2000-139465. 
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VI. Method for inhibiting the function of RM gene to restore fertility 

[0122] According to the present invention, the nucleic acid containing the locus of a fertility restorer gene (Rf-1) 
including the nucleic acids which function to restore fertility was isolated. The entire base sequence thereof was de- 

5 termined, whereby the fertility restoring function of the Rf-1 gene can be controlled by genetic engineering techniques. 
Thus, the present invention further provides a method for inhibiting the function of RM to restore fertility. 
[0123] A method for inhibiting the function of the RM gene to restore fertility according to one embodiment of the 
present invention comprises introducing an antisense having at least 100 continuous bases in length, and having a 
base sequence complementary to a nucleic acid having the base sequence of SEQ ID N0.27, or to a nucleic acid 

10 having a base sequence which is identical to at least 70% of the base sequence of SEQ ID NO.27, and which functions 
to restore fertility. 

[0124] In an embodiment, the method for inhibiting the function of the Rf-1 gene to restore fertility according to the 
present invention comprises introducing an antisense having at least 100 continuous bases in length, and being se- 
lected from base sequences complementary to a nucleic acid having the base sequence of bases 38538-541 23, pref- 

15 erably bases 42357-53743, more preferably bases 421 32-48883 of SEQ ID NO: 27, or to a nucleic acid having a base 
sequence which is identical to at least 70% of the base sequence of bases 38538-54123, preferably bases 
42357-53743, more preferably bases 42132-48883 of SEQ ID NO: 27 and, which functions to restore fertility. 
[0125] The antisense has a length of at least 1 00 bases or more, more preferably 500 bases or more, most preferably 
1000 bases or more. From the viewpoint of technical convenience of introduction, it preferably has a length of 10000 

20 bases or less, more preferably 5000 bases or less. The antisense can be synthesized by known methods. The antisense 
can be introduced into rice by known methods as described in e.g. Terada et al. (Plant Cell Physiol. 2000 Jul, 41(7), 
pp. 881-888). 

[01 26] It is also anticipated that Rf-1 disrupted lines can be established by screening variant lines containing a trans- 
posable element such as, but not limited to, Tos17 (Hirochika H. et al. 1996, Proc. Natl. Acad. Sci. USA 93, pp. 
25 7783-7788) for a line containing the transposable element in the base sequence of SEQ ID NO: 27. In plants, gene 
disruption by homologous recombination has. been studied. It may also be possible to inhibit fertility restoring function 
by establishing such a line in which the Rf-1 gene has been replaced by a variant RM gene using a nucleic acid having 
the base sequence of SEQ ID NO.27, or a nucleic acid having a base sequence which is identical to at least 70% of 
the base sequence of SEQ ID NO.27. 

30 
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Examples 

55 

[0128] The following examples further illustrate the present invention but are not intended to limit the technical scope 
of the invention. Those skilled in the art can readily add modifications/changes to the present invention on the basis 
of the description of the specification, and those modifications/changes are included in the technical scope of the 
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present invention. 
Reference examples 

[0129] The following reference examples are based on the examples described In our prior application (Japanese 
Patent Application No. 2000-247204 filed August 1 7, 2000). 

Reference example 1 ; Conversion of RFLP markers around Rf-1 gene to PCR markers 

[0130] In this reference example, nine RFLP markers (i.e., R1877, G291 , R2303, S12564, C1361 , S10019, G4003, 
S10602 and G2155) around the locus of Rf-1 gene were converted to PCR markers. 

(1) Materials and methods 

[0131] The following nine RFLP markers, R1877, G291, R2303, S12564, C1361, S10019, G4003, S10602 and 
G21 55, were purchased from the National Institute of Agrobiological Sciences, the Ministry of Agriculture, Forestry and 
Fisheries of Japan. After determining the base sequences of the inserts in the vectors, experiments were conducted 
according to the following procedures. Among rice varieties herein, Asominori belongs to japonica, and IR24 belongs 
to indica. 

(2) Preparation of Asominori genomic library 

[0132] Total DNA was extracted from green leaves of Asominori by the CTAB method. After partial digestion with 
Mbol, the DNA was fractionated according to size by NaCI density gradient centrifugation (6-20% linear gradient, 20°C, 
37,000 rpm, 4 hr, total volume = 12 mL). A portion of each fraction (about 0.5 mL) was subjected to electrophoresis 
and fractions containing 15-20 kb DNA were collected and purified. A library was constructed using Lambda DASH II 
(Stratagene) as a vector in accordance with the attached protocol. Giga Pack III Gold (Stratagene) was used for pack- 
aging. After packaging, 500 u.L of SM Buffer and 20 \xL of chloroform were added. After centrifugation, 20 ^L of chlo- 
roform was added to the supernatant to make a library solution, 

[0133] XL-1 Blue MRA (P2) was infected with 5 \lL of a 50-fold dilution of the library solution, whereupon 83 plaques 
were formed. This corresponded to 4.15 x 10 5 pfu per library, and hence, it was calculated that the plaques covered 
8.3 x 1 0 9 bp assuming that the average length of the inserted fragments was 20 kb. The library was therefore considered 
to have an adequate size for the rice genome (4 x 10 8 bp). 

(3) Isolation of genomic clones containing R1877-, C1361- and G4003-marker regions. 

[01 34] As for C1 361 and G4003, plasmids containing the RFLP marker probe were isolated and subjected to restric- 
tion enzyme treatment and electrophoresis to separate the RFLP marker probe portion; the desired DNA was recovered 
on a DNA recovery filter (Takara SUPREC-01). As for R1877, primers were designed that were specific to both ends 
of the marker probe and PCR was performed with the total DNA of Asominori used as a template; the amplification 
products were electrophoresed and recovered by the method described above. The recovered DNA was labelled with 
a Rediprime DNA Labelling System (Amersham Pharmacia) to prepare a probe for screening the library. PCR was 
performed in the usual manner (this also applies to the following description). 

[0135] Screening of the library was performed in the usual manner after blotting the plaques onto Hybond-N+ (Am- 
ersham Pharmacia). After primary screening, areas of positive plaques were individually punched out, suspended in 
SM buffer and subjected to the second round of screening. After the second screening, the positive plaques were 
punched out and subjected to the third round of screening to isolate a single plaque. 

[0136] The isolated plaque of interest was suspended in SM buffer and primary multiplication of the phage was 
performed by the plate lysate method. The resulting phage-enriched solution was subjected to secondary multiplication 
by shake culture and the phage DNA was purified with Lambda starter kit (QIAGEN). 

[01 37] For each marker, primary screening was conducted on eight plates. A 1 0 \iL aliquot of the library solution was 
employed per plate. After the primary, second and third rounds of screening, four genomic clones in association with 
R1 877 were isolated and three were isolated in association with each of C1361 and G4003. 

(4) Conversion of R1877 to PCR marker 

[01 38] The isolated genomic clones were analyzed to identify the causative site of RFLP, or the EcoRI site that exists 
in IR24 (indica rice) but not in Asominori Qaponica rice), thereby converting R1 877 to a PCR marker. 
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[0139] Specifically, the four isolated clones were subjected to the following analyses. First, T3 and T7 primers were 
used to determine the base sequences at both ends of the insert in each clone. Then, primers extending outwardly 
from both ends of the marker probe were designed. They were combined with T3 and T7 primers to give a combination 
of four primers in total, and employed in PCR with each clone used as the template. 
5 [0140] In a separate step, each clone was digested with Notl and EcoRI, and electrophoresed to estimate the insert 
size and the length of each EcoRI fragment. 

[0141] These analyses revealed the relative positions of the individual clones. In RFLP analysis, marker probe R1877 
was reported to detect an EcoRI fragment of 20 kb in Nipponbare (japonica rice) and one of 6.4 kb in Kasalath (indica 
rice) (ftp://ftp.staff.or.jp/pub/geneticmap98/parentsouthern/chrl 0/R1 877.JPG). This fact, taken together with the results 

10 of analysis described above, gave a putative position for the EcoRI site that existed in I R24 but not in Asominori. Hence, 
a primer combination (SEQ ID NO:1 x SEQ ID NO:2) that was designed to amplify the nearby region was employed 
to perform genomic PCR over 30 cycles, each cycle consisting of 94°C x 1 min, 58°C x 1 min and 72°C x 2 min. The 
PCR product was treated with EcoRI and subjected to electrophoresis on 0.7% agarose gel. 
[0142] As a result, the expected polymorphisms were observed between Asominori and IR24. By treatment with 

15 EcoRI, the PCR product (-3200 bp) was cleaved to yield 1500 bp and 1700 bp fragments in IR24 but not in Asominori. 
Mapping of the marker was made with an RIL (recombinant inbred line) of Asominori-IR24 with the results that the 
PCR marker was located in the same region as that of RFLP marker locus R1877, thereby confirming the conversion 
of RFLP marker R1 877 to a PCR marker, which was named R1877 EcoRI in the present invention. 

20 (5) Conversion of G4003 to PCR marker 

[0143] The isolated genomic clones were analyzed to identify the causative site of RFLP, or the Hindlll site that 
existed in Asominori but not in IR24, thereby converting G4003 to a PCR marker. 

[01 44] By performing analyses similar to those employed for R1 877, the relative positions of the three isolated clones 
25 were revealed. In RFLP analysis, marker probe G4003 was reported to detect a Hindlll fragment of 3 kb in Nipponbare 
(japonica rice) and one of 10 kb in Kasalath (indica rice) (ftp://ftp.staff.or.jp/pub/geneticmap98/parentsouthern/chrl 
0/R1877.JPG). This report, taken together with the analyses described above, led to a temporary conclusion that the 
Hindlll site that existed in Asominori but not in IR24 would be at either one of two candidate sites. Hence, a primer 
combination (SEQ ID NOS: 3 and 4) that was designed to amplify the area in the neighborhood of each Hindlll site 
30 was employed to perform genomic PCR over 35 cycles, each cycle consisting of 94°C x 30 sec, 58°C x 30 sec and 
72°C x 30 sec. The PCR product was treated with Hindlll and subjected to electrophoresis on 2% agarose gel. As a 
result, the Hindlll site within the marker probe was found to have polymorphisms. By treatment with Hindlll, the PCR 
product (362 bp) was cleaved to yield a 95 bp fragment and a 267 bp fragment in Asominori but not in IR24. Mapping 
of the site demonstrated the conversion of RFLP marker G4003 to a PCR marker, which was named G4003 Hindlll 
35 (SEQ ID NO:19) in the present invention. 

(6) Conversion of C1361 to PCR marker 

[0145] Primers were designed on the basis of the base sequence information of the isolated genomic clones. PCR 
40 was performed with the total DNAs of Asominori and I R24 being used as a template and the PCR product was recovered 

by known methods after electrophoresis. Using the recovered DNA as a template, the inventors analyzed the base 

sequence of each of the rice varieties with ABI Model 31 0 in search of mutations that would cause polymorphisms. 

[0146] By performing analyses similar to those employed for R1877, approximate relative positions of the three 

isolated clones could be established. As it turned out, however, regions around the C1361 marker would be difficult to 
45 amplify by PCR or determine their base sequences, and hence, it would not be easy to identify the causative site of 

RFLP. Hence, the inventors took notice of the region capable of yielding a comparatively long PCR product (2.7 kb) 

and made an attempt to create a dCAPS marker. 

[01 47] Specifically, upon comparing the base sequences of the genomic PCR products of said region using Asominori 
and Koshihikari (both japonica rice) and Kasalath and IR24 (both indica rice), the inventors found six sites of polymor- 

50 phism between japonica and indica. One of these six sites was used to create a dCAPS marker To this end, with SEQ 
ID NO:5 and SEQ ID NO:6 used as primers, PCR was performed over 35 cycles, each cycle consisting of 94°C x 30 
sec, 58°C x 30 sec and 72°C x 30 sec. The PCR product was treated with Mwol and analyzed by electrophoresis on 
3% MetaPhor™ agarose gel. In Asominori, cleavage occurred at two sites to give three observable bands of about 25 
bp, 50 bp and 79 bp, but in IR24 cleavage occurred at one site to give two observable bands of about 50 bp and 107 

55 bp. Mapping demonstrated the conversion of RFLP marker C1361 to a PCR marker, which was named C1361 Mwol 
(SEQ ID NO:20) in the present invention. 
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(7) Conversion of G2155 to PCR marker 

[01 48] Primers specific to both ends of the marker probe were designed and PCR was performed with the total DNA 
of Asominori, Koshihikari, IR24 or IL216 (a line produced by introducing RM gene into Koshihikari by back crossing; 
s its genotype was Rf-1/RM) being used as a template. Purification of the PCR product and searching for a mutation 
that would be useful for providing restriction fragment polymorphisms were performed by the methods already described 
above. 

[0149] Specifically, as a result of comparing the base sequences of corresponding regions of the varieties under test, 
mutations were found at three sites between the variety/line (IR24 and IL21 6) having Rf-1 gene and the variety (As- 

10 ominori and Koshihikari) not having RM gene. One of the three sites was utilized to create a dCAPS marker. To this 
end, SEQ ID NO:7 and SEQ ID NO:8 were used as primers to perform PCR over 35 cycles, each cycle consisting of 
94°C x 30 sec, 58°C x 30 sec and 72°C x 30 sec. The PCR product was treated with Mwol and analyzed by electro- 
phoresis on 3% MetaPhor™ agarose gel. In Asominori, cleavage occurred at one site to give two observable bands 
of about 25 bp and 1 05 bp, but in IR24, cleavage occurred at two sites to give three observable bands of about 25 bp, 

'5 27 bp and 78 bp. Mapping demonstrated the conversion of RFLP marker G2155 to a PCR marker, which was named 
G2155 Mwol (SEQ ID NO:21) in the present invention. 

(8) Conversion of G291 to PCR marker 

20 [01 50] Primers specific to internal sequences of the marker probe were designed and used in various combinations 
to perform PCR to find a primer combination that could yield an amplification product of the expected size. Using the 
selected primer combination, the inventors performed PCR with the total DNA of Asominori, Koshihikari, IR24 and 
IL21 6 used as a template. Purification of the PCR product and searching for a mutation that could be utilized in providing 
restriction fragment polymorphisms were performed by the methods already described above. 

25 [01 51 ] Specifically, using the primers designed to be specific for the marker probe sequence, the inventors performed 
genomic PCR of each variety under test and compared the base sequences of the products. As a result, mutations 
were found at four sites between the variety/line having Rf-1 gene (IR24 and IL216) and the variety (Asominori and 
Koshihikari) not having Rf-1 gene. One of the four sites was used to create a dCAPS marker. To this end, SEQ ID NO: 
9 and SEQ ID NO:10 were used as primers to perform PCR over 35 cycles, each cycle consisting of 94°C x 30 sec, 

30 58°C x 30 sec and 72°C x 30 sec. The PCR product was treated with Mspt and analyzed by electrophoresis on 3% 
MetaPhor™ agarose gel. In the varieties/lines having RM gene, cleavage occurred at two sites to give three observable 
bands of about 25 bp, 49 bp and 55 bp, but in the varieties not having Rf-1 gene, cleavage occurred at one site to give 
two observable bands of about 25 bp and 104 bp. Mapping demonstrated the conversion of RFLP marker G291 to a 
PCR marker, which was named G291 Mspl (SEQ ID NO:22) in the present invention. 

35 

(9) Conversion of R2303 to PCR marker 

[0152] Primers specific to internal sequences of the marker probe were designed and PCR was performed with the 
total DNA of Asominori (japonica rice) and IR24 and Kasalath (indica rice) used as a template. Purification of the PCR 
^0 product and searching for a mutation that could be used for providing restriction fragment polymorphisms were per- 
formed by the methods already described above. 

[01 53] As a result of comparing the base sequences of corresponding regions of the varieties under test, a mutation 
was found between japonica rice and indica rice. Since the mutation occurred at the Bsll recognition site, the site was 
directly used to create a CAPS marker. To this end, SEQ ID NO:1 1 and SEQ ID NO:12 were used as primers and PCR 
45 was performed over 30 cycles, each cycle consisting of 94°C x 1 min, 58°C x 1 min and 72°C x 2 min. The PCR product 
was treated with Bsll and analyzed by electrophoresis on 2% agarose gel. In japonica rice, cleavage occurred at one 
site to give two observable bands of about 238 bp and 1334 bp, but in indica rice, cleavage occurred at two sites to 
give three observable bands of about 238 bp, 655 bp and 679 bp. Mapping demonstrated the conversion of RFLP 
marker R2303 to a PCR marker, which was named R2303 Bsll (SEQ ID NO:23) in the present invention. 

50 

(1 0) Converting S1 001 9 to PCR marker 

[0154] S1 001 9 was converted to a PCR marker in accordance with the method (9) of converting R2303 to a PCR 
marker. 

55 [01 55] Specifically, as a result of comparing the base sequences of corresponding regions of the varieties under test, 
a mutation was found between japonica rice and indica rice. Since the mutation occurred at the BstUI recognition site, 
the site was directly used to create a CAPS marker. To this end, SEQ ID NO:13 and SEQ ID NO:14 were used as 
primers and PCR was performed over 30 cycles, each cycle consisting of 94°C x 1 min, 58°C x 1 min and 72°C x 1 
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min. The PCR product was treated with BstUI and analyzed by electrophoresis on 2% agarose gel. In japonica rice, 
cleavage occurred at one site to give two observable bands of about 130 bp and 462 bp, but in indica rice, cleavage 
occurred at two sites to give three observable bands of about 130 bp, 21 8 bp and 244 bp. Mapping demonstrated the 
conversion of RFLP marker S1 001 9 to a PCR marker, which was named S1 001 9 BstUI (SEQ ID NO:24) in the present 
5 invention. 

(11) Conversion of S1 0602 to PCR marker 

[0156] S10602 was converted to a PCR marker in accordance with the method (9) of converting R2303 to a PCR 
*0 marker. 

[0157] Specifically, as a result of comparing the base sequences of corresponding regions of the varieties under test, 
a mutation was found between japonica rice and indica rice. The mutation was used to create a CAPS marker. To this 
end, SEQ ID NO:15 and SEQ ID NO:16 were used as primers and PCR was performed over 33 cycles, each cycle 
consisting of 94°C x 1 min, 58°C x 1 min and 72°C x 1 min. The PCR product was treated with Kpnl and analyzed by 
'5 electrophoresis on 2% agarose gel. In japonica rice, cleavage occurred at one site to give two observable bands of 
about 117 bp and 607 bp, but in indica rice, no cleavage occurred, giving only an observable band of 724 bp. 
Mapping demonstrated the conversion of RFLP marker S10602 to a PCR marker, which was named S10602 Kpnl 
(SEQ ID NO:25) in the present invention. 

20 (12) Conversion of S1 2564 to PCR marker 

[01 58] S1 2564 was converted to a PCR marker in accordance with the method of converting R2303 to a PCR marker. 
[01 59] Specifically, as a result of comparing the base sequences of corresponding regions of the varieties under test, 
a mutation was found between japonica rice and indica rice. The mutation was used to create a dCAPS marker. To 

25 this end, SEQ ID NO:1 7 and SEQ ID NO:1 8 were used as primers and PCR was performed over 35 cycles, each cycle 
consisting of 94°C x 30 sec, 58°C x 30 sec and 72°C x 30 sec. The PCR product was treated with Tsp509l and analyzed 
by electrophoresis on 3% MetaPhor™ agarose gel. In japonica rice, cleavage occurred at two sites to give three ob- 
servable bands of 26 bp, 41 bp and 91 bp, but in indica rice, cleavage occurred at one site to give two observable 
bands of 41 bp and 117 bp. Mapping demonstrated the conversion of RFLP marker S12564 to a PCR marker, which 

30 was named S12564 Tsp509l (SEQ ID NO:26) in the present invention. 

Reference example 2: Mapping of Rf-1 Gene Locus 

[0160] DNA was extracted from 1042 seedlings of the F1 population produced by pollinating MS Koshihikari with 
35 MS-FR Koshihikari, and the DNA extract was used in the analysis. MS Koshihikari (generation: BC10F1 ) was created 
by replacing the cytoplasm of Koshihikari with BT type male sterility cytoplasm. MS-FR Koshihikari was a line created 
by introducing Rf-1 gene from IR8 (supplied from National Institute of Agrobiological Sciences) into MS Koshihikari 
(the locus of Rf-1 gene being heterozygous). 

[0161] First, each individual was investigated for the genotype at two marker loci R1877 EcoRI and G2155 Mwol 
^0 described in Reference example 1 that would presumably be located on opposite sides of the locus of Rf-1 gene. 
Japonica type homozygotes with respect to either locus R1877 EcRI or G2155 Mwol were regarded as recombinants 
between these two marker loci. Then, each of such recombinants was investigated for the genotypes of G291 Mspl, 
R2303 Bsll, S12564 Tsp 509I, C1361 Mwol, S10019 BstUI, G4003 Hindlll and S1 0602 Kpnl loci, and the positions of 
recombination were identified. 

45 [0162] The genotype investigation with respect to R1 877 EcoRI and G2155 Mwol loci revealed that 46 individuals 
were recombinants around the locus of Rf-1 gene. Genotypes of the marker loci around the locus of Rf-1 gene were 
investigated and the results are shown in Table 3. 
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Table 3. Genotypes of Marker Loci in Recombinant 
Individuals Around Rf-1 Locus 
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J: Homozygous Koshihikarl type 

H: Heterozygous Koshihikari type/ MS-FR Koshihikari type 

[0163] As shown in Table 3, recombinant 8 homozygous for japonica at the S12564 Tsp509l marker locus and re- 
combinants 9 and 1 0 homozygous for japonica at the C1 361 Mwo marker locus were obtained. As all of these recom- 
binants restored fertility, the former was regarded as a recombinant between the Rf-1 and S1 2564 Tsp509l loci while 
the latter were regarded as recombinants between the Rf-1 and C1361 Mwol loci, showing that the Rf-1 gene is located 
between the S12564Tsp509l and C1361 Mwol loci. Based on the report that only pollens carrying the Rf-1 gene have 
fertility in individuals having the BT type male sterile cytoplasm in the cross above (C.Shinjyo, JAPAN J. GENETICS 
Vol.44, No. 3:149-156(1969)), the Rf-1 gene locus could be located on a detailed linkage map (Fig. 4). 

Example 1 : Acquisition of recombinant individuals proximal to the RM locus 

(Materials and Methods) 

[0164] DNA was extracted from each of 4103 individuals of BC10F1 population produced by pollinating MS Koshi- 
hikari (generation: BC10F1) with MS-FR Koshihikari (generation: BC9F1 , heterozygous at the RM locus), and geno- 
typed at the S12564 Tsp509l and C1361 Mwol loci in the same manner as described in Reference example 2 above. 
Individuals having a genotype homozygous for Koshihikari at the S12564 Tsp509l locus were regarded as those gen- 
erated by recombination between the Rf-1 and S12564Tsp509l loci, while individuals having a genotype homozygous 
for Koshihikari at the C1361 Mwol locus were regarded as those generated by recombination between the RM and 
C1361 Mwol loci. 

(Results and Discussion) 

[0165] A survey of 4103 individuals revealed one recombinant individual between the RM and S12564Tsp509l loci 
and 6 recombinant individuals between the Rf-1 and C1 361 Mwol loci. The previous survey of 1 042 individuals obtained 
by crossing in Reference example 2 above had already revealed one recombinant individual between the RM and 
S12564 Tsp509l loci and 2 recombinant individuals between the RM and C1361 Mwol loci as shown in Table 3. 
[0166] Thus, a total of 2 recombinant individuals between the RM and S12564 Tsp509l loci and 8 recombinant 
individuals between the Rf- 1 and C 1 361 Mwol loci were able to be oabtained from 51 45 individuals. These 1 0 individuals 
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were tested by high-precision segregation analysis in the examples below. 
Example 2 chromosomal walking 
(1) First chromosomal walking 
(Materials and Methods) 

[0167] A genomic library was constructed from the genomic DNA of Asominori japonica (not carrying Rf-1) using 
Lambda DASH II vector as described in Reference example 1 and tested by chromosomal walking. 
[0168] PCR was routinely performed using total DNA of Asominori as a template in combination with the following 
primer pair: 



5 ' -atcaggagccttcaaattgggaac-3 ' (SEQ ID NO: 29) 

and 



5 9 ~ctcgcaaattgottaattttgacc-3 ' (SEQ ID NO: 30) 

designed for a partial base sequence (Accession No. D47284) of RFLP probe S12564. The resulting amplification 
products of about 1200 bp were electrophoresed on an agarose gel and then purified by QIAEXII (QIAGEN). The 
purified DNA was labeled with a rediprime DNA labelling system (Amersham Pharmacia) to give a library screening 
probe (probe A, Fig. 1). 

[0169] The library was routinely screened after plaques were blotted onto Hybond-N+ (Amersham Pharmacia). Single 
plaques were separated, after which phage DNA was purified by the plate lysate method using Lambda Midi kit (QIA- 
GEN). 

(Results and Discussion) 

[0170] The results of terminal base sequence analysis and restriction enzyme fragment length analysis showed that 
two (WSA1 and WSA3) of 4 clones obtained by screening were in a relative position as shown in Fig. 1 . The Asominori 
genomic base sequences corresponding to WSA1 and WSA3 were determined by primer walking (DNA Sequencer 
377, ABI). 

(2) Second chromosomal walking 
(Materials and Methods) 

[0171] In addition to the Asominori genomic library described above, an IR24 genomic library was similarly construct- 
ed from the genomic DNA of an indica variety IR24 (carrying Rf-1) and tested by chromosomal walking. 
[01 72] PCR was routinely performed using DNA of WSA3 as a template in combination with the following primer pair: 

5' -tgaaggagttatgggtgcgtgacg-3 9 (SEQ ID NO: 31) 

and 



5 ' -ttgccgagcacacttgccatgtgc-3 ' (SEQ ID NO: 32) 

designed for the Asominori genomic base sequence determined in (1). The resulting amplification products of 
524 bp were purified and labeled by the method described above to give a library screening probe (probe E, Fig. 1). 
[0173] Library screening and phage DNA purification were performed by the method described above. 
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(Results and Discussion) 

[0174] The results of terminal base sequence analysis and restriction enzyme fragment length analysis showed that 
one (WSE8) of 15 clones obtained by screening of the Asominori genomic library was in a relative position as shown 
in Fig. 1 . The Asominori genomic base sequence corresponding to WSE8 was determined by primer walking. 
[01 75) The results of terminal base sequence analysis and restriction enzyme fragment length analysis showed that 
two (XSE1 and XSE7) of 7 clones obtained by screening of the IR24 genomic library were in a relative position as 
shown in Fig. 1. The IR24 genomic base sequences corresponding to XSE1 and XSE7 were determined by primer 
walking. 

(3) Third chromosomal walking 
(Materials and Methods) 

[0176] The Asominori genomic library and IR24 genomic library described above were tested by chromosomal walk- 
ing. 

[01 77] PC R was routinely performed using DNA of WSE8 as a template in combination with the following primer pair: 
5 ' -gcgacgcaatggacatagtgctcc-3 9 (SEQ ID NO: 33) 

and 



5 ' -ttacctgccaagcaatatccatcg-3 • (SEQ ID NO: 34) 

designed for the Asominori genomic base sequence determined in (2). The resulting amplification products of 
1 1 59 bp were purified and labeled by the method described above to give a library screening probe (probe F, Fig. 1 ). 
[01 78] Library screening and phage DNA purification were performed by the method described above. 

(Results and Discussion) 

[0179] The results of terminal base sequence analysis and restriction enzyme fragment length analysis showed that 
two (WSF5 and WSF7) of 8 clones obtained by screening of the Asominori genomic library were in a relative position 
as shown in Fig. 1 . The Asominori genomic base sequences corresponding to WSF5 and WSF7 were determined by 
primer walking. 

[01 80] The results of terminal base sequence analysis and restriction enzyme fragment length analysis showed that 
two (XSF4 and XSF20) of 13 clones obtained by screening of the IR24 genomic library were in a relative position as 
shown in Fig. 1 . The IR24 genomic base sequences corresponding to XSF4 and XSF20 were determined by primer 
walking. 

(4) Fourth chromosomal walking 
(Materials and Methods) 

[0181] The Asominori genomic library and IR24 genomic library described above were tested by chromosomal walk- 
ing. 

[01 82] PCR was routinely performed using DNA of WSF7 as a template in combination with the following primer pair: 
5 ' -aaggcatactcagtggagggcaag-3 ' (SEQ ID NO: 35) 

and 



5 ' -ttaacctgaccgcaagcacctgtc-3 ' (SEQ ID NO: 36) 

designed for the Asominori genomic base sequence determined in (3). The resulting amplification products of 
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456 bp were purified and labeled by the method described above to give a library screening probe (probe G, Fig. 1). 
[0183] Library screening and phage DNA purification were performed by the method described above. 

(Results and Discussion) 

[0184] The results of terminal base sequence analysis and restriction enzyme fragment length analysis showed that 
two (WSG2 and WSG6) of 6 clones obtained by screening of the Asominori genomic library were in a relative position 
as shown in Fig. 1 . The Asominori genomic base sequences corresponding to WSG2 and WSG6 were determined by 
primer walking. 

[0185] The results of terminal base sequence analysis and restriction enzyme fragment length analysis showed that 
three (XSG8, XSG16 and XSG22) of 14 clones obtained by screening of the IR24 genomic library were in a relative 
position as shown in Fig. 1. The IR24 genomic base sequences corresponding to XSG8, XSG16 and XSG22 were 
determined by primer walking. 

(5) Fifth chromosomal walking 

(Materials and Methods) 

[0186] The IR24 genomic library described above was tested by chromosomal walking. 

[01 87] We perused the public website of TIGR (The Institute for Genomic Research) and found that a BAC (Bacterial 
Artificial Chromosome) clone (Accession No. AC068923) containing RFLP marker S12564 had been deposited with 
a public database (GenBank). This BAC clone contains the genomic DNA of Nipponbare japonica and it was shown 
from base sequence comparison to completely include the contig regions of Asominori and IR24 prepared in (1 W4) 
(Fig. 2). ~ ' K K M ' 

[0188] Thus, PCR was routinely performed using total DNA of IR24 as a template in combination with the following 
primer pair: 



5'-tggatggactatgtggggtcagtc-3' (SEQ ID NO: 37) 



and . 



5 ' -agtggaagtggagagagtagggag-3 ' (SEQ ID NO; 38) 

designed to amplify a part of this BAC clone. The resulting amplification products of about 600 bp were purified and 

labeled by the method described above to give a library screening probe (probe H, Fig. 1). 

[0189] Library screening and phage DNA purification were performed by the method described above. 

(Results and Discussion) 

[0190] The results of terminal base sequence analysis and restriction enzyme fragment length analysis showed that 
one (XSH18) of 15 clones obtained by screening of the IR24 genomic library was in a relative position as shown in 
Fig. 1 . The IR24 genomic base sequence corresponding to XSH18 was determined by primer walking. 

Example 3: High precision segregation analysis 

(1) Development of PCR marker P4497 Mbol 

[0191] Comparison between the genomic base sequence corresponding to the IR24 contig (SEQ ID NO: 27) and 
the genomic base sequence corresponding to the Asominori contig (SEQ ID NO: 28) determined in Example 2 revealed 
that the 1239th base.of SEQ ID NO: 27 is A while the 12631st base of SEQ ID NO: 28 corresponding to said position isG. 
[0192] For detecting this change, fragments of about 730 bp are first amplified by PCR from a region surrounding 
said position using the following primer pair: P4497 Mbol F: 
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5' -ccctccaacacataaatggttgag-3' (SEQ ID NO: 39) 



(corresponding to bases 853-876 of SEQ ID NO: 27) 
(corresponding to bases 12247-12270 of SEQ ID NO: 28) and 
P4497 Mbol R: 



5' -tttctgccaggaaactgttagatg-3' (SEQ ID NO: 40) 



(corresponding to bases 1583-1560 of SEQ ID NO: 27) 

(corresponding to bases 12975-12952 of SEQ ID NO: 28). 
The amplification products can be visualized by electrophoresis on an agarose gel after treatment with Mbol. Thus, 
the change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA 
after Mbol treatment because the amplification products from Asominori DNA having an Mbol recognition sequence 
(GATC) are cleaved with Mbol while the amplification products from IR24 DNA are not cleaved with Mbol for the lack 
of the Mbol recognition sequence. 

(2) Development of PCR marker P9493 Bsll 

[0193] Comparison between the genomic base sequence corresponding to the IR24 contig (SEQ ID NO: 27) and 
the genomic base sequence corresponding to the Asominori contig (SEQ ID NO: 28) determined in Example 2 revealed 
that the 6227th base of SEQ ID NO: 27 is A while the 1 7627th base of SEQ ID NO: 28 corresponding to said position is C. 
[0194] For detecting this change, fragments of 126 bp are first amplified by PCR from a region surrounding said 
position using the following primer pair: 
P9493 Bsll F: 



5' -gcgatcttatacgcatact:atgcg-3' (SEQ ID NOs 41) 

(corresponding to bases 6129-6152 of SEQ ID NO: 27) 
(corresponding to bases 1 7529-1 7552 of SEQ ID NO: 28) and 
P9493 Bsll R: 



5 ' -aaagtctttgttccttcaccaagg-3 ' (SEQ ID NO: 42) 

(corresponding to bases 6254-6231 of SEQ ID NO: 27) 

(corresponding to bases 1 7654-1 7631 of SEQ ID NO: 28). 
The amplification products can be visualized by electrophoresis on an agarose gel after treatment with Bsll. Thus, the 
change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA after 
Bsll treatment because the amplification products from Asominori DNA having a Bsll recognition sequence (CC- 
NNNNNNNGG) are cleaved with Bsll while the amplification products from IR24 DNA are not cleaved with Bsll for the 
lack of the Bsll recognition sequence. 

[0195] This marker was developed by applying the dCAPS method (Michaels and Amasino 1998, Neff et al., 1998). 
Specifically, g is substituted for a at the base 6236 of SEQ ID NO: 27 and the base 17636 of SEQ ID NO: 28 by the 
use of P9493 Bsll R primer described above. Thus, the fragments from Asominori DNA come to have a sequence of 
CCtttccttGG at 1 7626-1 7636 of SEQ ID NO: 28 so that they are cleaved with Bsll. 

(3) Development of PCR marker P23945 Mbol 

[0196] Comparison between the genomic base sequence corresponding to the IR24 contig (SEQ ID NO: 27) and 
the genomic base sequence corresponding to the Asominori contig (SEQ ID NO: 28) determined in Example 2 revealed 
that the 20680th base of SEQ ID NO: 27 is G while the 32079th base of SEQ ID NO: 28 corresponding to said position 

[0197] For detecting this change, fragments of 260 bp are first amplified by PCR from a region surrounding said 
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position using the following primer pair: P23945 Mbol F: 



5' -gaggatttatcaaaacaggatggacg-3 9 (SEQ ID NO: 43) 

(corresponding to bases 20519-20544 of SEQ ID NO: 27) 
(corresponding to bases 31918-31943 of SEQ ID NO: 28) and 
P23945 Mbol R: 



5' -tgggcggcagcagtggaggataga-3' (SEQ ID NO: 44) 



(corresponding to bases 20778-20755 of SEQ ID NO: 27) 

(corresponding to bases 32177-32154 of SEQ ID NO: 28). 
The amplification products can be visualized by electrophoresis on an agarose gel after treatment with Mbol. Thus, 
the change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA 
after Mbol treatment because the amplification products from IR24 DNA having an Mbol recognition sequence (GATC) 
are cleaved with Mbol while the amplification products from Asominori DNA are not cleaved with Mbol for the lack of 
the Mbol recognition sequence. 

(4) Development of PCR marker P41030 Taql 

[0198] Comparison between the genomic base sequence corresponding to the IR24 contig (SEQ ID NO: 27) and 
the genomic base sequence corresponding to the Asominori contig (SEQ ID NO: 28) determined in Example 2 revealed 
that the 45461 st base of SEQ ID NO: 27 is A while the 491 64th base of SEQ ID NO: 28 corresponding to said position 
is G. 

[0199] For detecting this change, fragments of 280 bp are first amplified by PCR from a region surrounding said 
position using the following primer pair: 
P41 030 Taql F: 



5 ' -aagaagggagggttatagaatctg-3' (SEQ ID NO: 45) 



(corresponding to bases 45369-45392 of SEQ ID NO: 27) 
(corresponding to bases 49072-49095 of SEQ ID NO: 28) and 
P41 030 Taql R: 



5 ' -atatcaggactaacaccactgctc-3 ' (SEQ ID NO: 46) 



(corresponding to bases 45648-45625 of SEQ ID NO: 27) 

(corresponding to bases 49351-49328 of SEQ ID NO: 28). 
The amplification products can be visualized by electrophoresis on an agarose gel after treatment with Taql. Thus, the 
change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA after 
Taql treatment because the amplification products from Asominori DNA having a Taql recognition sequence (TCGA) 
are cleaved with Taql while the amplification products from IR24 DNA are not cleaved with Taql for the lack of the Taql 
recognition sequence. 

(5) Development of PCR marker P451 77 BstUI 

[0200] Comparison between the genomic base sequence corresponding to the IR24 contig (SEQ ID NO: 27) and 
the genomic base sequence corresponding to the Asominori contig (SEQ ID NO: 28) determined in Example 2 revealed 
that the 49609th base of SEQ ID NO: 27 is A while the 5331 1 st base of SEQ ID NO: 28 corresponding to said position 
isG. 

[0201] For detecting this change, fragments of 812 bp are first amplified by PCR from a region surrounding said 



position using the following primer pair: 
P451 77 BstUI F: 
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5' -acgagtagtagcgatcttccagcg-3' (SEQ ID NO: 47) 



(corresponding to bases 49355-49378 of SEQ ID NO: 27) 
(corresponding to bases 53057-53080 of SEQ ID NO: 28) and 
P45177 BstUI R: 



5' -cagcgtgaaactaaaaacggaggc-3' (SEQ ID NO: 48) 

(corresponding to bases 50166-50143 of SEQ ID NO: 27) 

(corresponding to bases 53868-53845 of SEQ ID NO: 28). 
The amplification products can be visualized by electrophoresis on an agarose gel after treatment with BstUI. Thus, 
the change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA 
after BstUI treatment because the amplification products from IR24 DNA having a BstUI recognition sequence (CGCG) 
at two positions are cleaved into 3 fragments with BstUI while the amplification products from Asominori DNA having 
the BstUI recognition sequence at three positions are cleaved with BstUI into four fragments. 

(6) Development of PCR marker B60304 Mspl 

[0202] Comparison between the genomic base sequence corresponding to the IR24 contig (SEQ ID NO: 27) deter- 
mined in Example 2 and the base sequence of the BAC clone described above (Accession No. AC068923) revealed 
that the 56368th base of SEQ ID NO: 27 is T while the base of AC068923 corresponding to said position is C. 
[0203] For detecting this change, fragments of about 330 bp are first amplified by PCR from a region surrounding 
said position using the following primer pair: 
B60304 Mspl F: 



5 ' -atcccacatcatcataatccgacc-3 ' (SEQ ID NO; 49) 

(corresponding to bases 56149-56172 of SEQ ID NO: 27) and 
B60304 Mspl R: 



5* -agcttctcccttggatacggtggcg-3 ' (SEQ ID NO: 50) 

(corresponding to bases 56479-56455 of SEQ ID NO: 27). 
The amplification products can be visualized by electrophoresis on an agarose gel after treatment with Mspl. Thus, 
the change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA 
after Mspl treatment because the amplification products from Nipponbare DNA having an Mspl recognition sequence 
(CCGG) are cleaved with Mspl while the amplification products from IR24 DNA are not cleaved with Mspl for the lack 
of the Mspl recognition sequence. 

[0204] This marker was developed by applying the dCAPS method. Specifically, t is substituted for g at base 56463 
of SEQ ID NO: 27 by the use of B60304 Mspl R primer. As a result, the Mspl recognition sequence of bases 56460-56463 
of SEQ ID NO: 27 changes from CCGG into ccgt so that the fragments from SEQ ID NO: 27 become unable to be 
cleaved with Mspl. Thus, the fragments from IR24 have no Mspl recognition sequence, while DNA from Nipponbare 
has the Mspl recognition sequence at one position in a region corresponding to bases 56367-56370 of SEQ ID NO: 27. 

(7) Development of PCR marker B59066 BsaJI 

[0205] Comparison between the genomic base sequence corresponding to the IR24 contig (SEQ ID NO: 27) deter- 
mined in Example 2 and the base sequence of the BAG clone described above (Accession No. AC068923) revealed 
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that the 57629th base of SEQ ID NO: 27 is C while the base of AC068923 corresponding to said position is CC. 
[0206] For detecting this change, fragments of about 420 bp are first amplified by PCR from a region surrounding 
said position using the following primer pair: 
B59066 BsaJI F: 



5 f -atttgttggttagttgcggctgag-3' (SEQ ID NO: 51) 

(corresponding to bases 57563-57586 of SEQ ID NO: 27) and 
B59066 BsaJI R: 



5 ' -gcccaaactcaaaaggagagaacc-3 ' (SEQ ID NO: 52) 

(corresponding to bases 57983-57960 of SEQ ID NO: 27). 
The amplification products can be visualized by electrophoresis on an agarose gel after treatment with BsaJI. Thus, 
the change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA 
after BsaJI treatment because the amplification products from Nipponbare DNA having a BsaJI recognition sequence 
(CCNNGG) are cleaved with BsaJI while the amplification products from IR24 DNA are not cleaved with BsaJI for the 
lack of the BsaJI recognition sequence. 

(8) Development of PCR marker B56691 Xbal 

[0207] Comparison between the genomic base sequence corresponding to the IR24 contig (SEQ ID NO: 27) deter- 
mined in Example 2 and the base sequence of the BAC clone described above (Accession No. AC068923) revealed 
that the 66267th base of SEQ ID NO: 27 is G while the base of AC068923 corresponding to said position is C. 
[0208] For detecting this change, fragments of about 670 bp are first amplified by PCR from a region surrounding 
said position using the following primer pair: 
B56691 Xbal F: 



5 " - cc t caag t ct cccct aaagccact - 3 ' (SEQ ID NO: 53) 

(corresponding to bases 66129-66152 of SEQ ID NO: 27) and 
B56691 Xbal R: 



5'-gctctactgctgataaaccgtgag-3' (SEQ ID NO; 54) 

(corresponding to bases 66799-66776 of SEQ ID NO: 27). 
The amplification products can be visualized by electrophoresis on an agarose gel after treatment with Xbal. Thus, the 
change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA after 
Xbal treatment because the amplification products from Nipponbare DNA having an Xbal recognition sequence (TCTA- 
GA) are cleaved with Xbal while the amplification products from IR24 DNA are not cleaved with Xbal for the lack of 
the Xbal recognition sequence. 

(9) Development of PCR marker B53627 Bst217l 

[0209] Comparison between the genomic base sequence corresponding to the IR24 contig (SEQ ID NO: 27) deter- 
mined in Example 2 and the base sequence of the BAC clone described above (Accession No. AC068923) revealed 
that the 69331 st base of SEQ ID NO: 27 is T while the base of AC068923 corresponding to said position is C. 
[0210] For detecting this change, fragments of about 620 bp are first amplified by PCR from a region surrounding 
said position using the following primer pair: 
B53627 BstZ17l F: 



5' -tggatggactatgtggggtcagtc-3' (SEQ ID NO: 55) 
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(corresponding to bases 68965-68988 of SEQ ID NO: 27) and 
B53627 BstZ17l R: 

5' -agtggaagtggagagagtagggag-3' (SEQ ID NO: 56) 

(corresponding to bases 69582-69559 of SEQ ID NO: 27). 
The amplification products can be visualized by electrophoresis on an agarose gel after treatment with BstZ17l. Thus, 
the change can be detected as a difference In mobility in the agarose gel due to the difference in the length of DNA 
after BstZ17l treatment because the amplification products from IR24 DNA having a BstZ17l recognition sequence 
(GTATAC) are cleaved with BstZ1 71 while the amplification products from Nipponbare DNA are not cleaved with BstZ1 71 
for the lack of the BstZ1 71 recognition sequence. 

(10) Development of PCR marker B40936 Msel 

[021 1 ] Development of all the following PCR markers (1 0)-(1 2) relates to a study of the base sequences correspond- 
ing to further downstream regions (3') of base 76363 at the 3*end of SEQ ID NO: 27. 

[021 2] The following primer pair was designed for the base sequence of the BAG clone described above (Accession 
No. AC068923): 

5'-tacgacgccatttcactccattgc-3' (SEQ ID NO: 57) 

and 

5' -catttctctatgggcgttgctctg-3 f (SEQ ID NO: 58). 

PCR was routinely performed using this primer pair in combination with total DNAs of MS-FR Koshihikari (genotype 
of the RM locus: Rf-1 Rf-1 ) and Koshihikari as templates. The resulting amplification products of about 1 300 bp were 
electrophoresed on an agarose gel and then purified by QIAEXil (QIAGEN). Analysis of the base sequence of the 
purified DNA by a DNA sequencer 377 (ABI) showed several polymorphisms. 

[0213] One of them can be detected by PCR amplification of a region surrounding said position using the followinq 
primer pair: 
B40936 Msel F: 

5' -acctgtaggtatggcaccttcaacac-3' (SEQ ID NO: 59) 

and 

B40936 Msel R: 

5 ' -ccaaggaacgaagttcaaatgtatgg-3 ' (SEQ ID NO: 60). 

The amplification products can be visualized by electrophoresis on an agarose gel after treatment with Msel. Thus 
the change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA 
after Msel treatment because the amplification products from MS-FR Koshihikari (Rf-1 Rf-1) DNA having an Msel 
recognition sequence (TTAA) are cleaved with Msel while the amplification products from Koshihikari DNA are not 
cleaved with Msel for the lack of the Msel recognition sequence. 
[0214] This marker was developed by applying the dCAPS method. 

(11) Development of PCR marker B19839 Mwol 



[0215] The following primer pair was designed for the base sequence of the BAC clone described above (Accession 
No. AC068923): 
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5' -tgatgtgtttgggcatccctttcg-3' (SEQ ID NO: 61) 



and 



5' -gagataggggacgacagacacgac-3' (SEQ ID NO: 62). 

PCR was routinely performed using this primer pair in combination with total DNAs of MS-FR Koshihikari (genotype 
of the RM locus: Rf-1 RM) and Koshihikari as templates. The resulting amplification products of about 1200 bp were 
electrophoresed on an agarose gel and then purified by QIAEXII (QIAGEN). Analysis of the base sequence of the 
purified DNA by a DNA sequencer 377 (ABI) showed several polymorphisms. 

[0216] One of them can be detected by PCR amplification of a region surrounding said position using the following 
primer pair: 
B19839Mwol F: 



5 r -tcctatggctgtttagaaactgcaca-3 9 (SEQ ID NO: 63) 



and 

B19839 Mwol R: 



5' -caagttcaaacataactggcgttg-3' (SEQ ID NO: 64) # 

The amplification products can be visualized by electrophoresis on an agarose gel after treatment with Mwol. Thus, 
the change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA 
after Mwol treatment because the amplification products from Koshihikari DNA having an Mwol recognition sequence 
(GCNNNNNNNGC) are cleaved with Mwol while the amplification products from MS-FR Koshihikari (Rf-1 Rf-1) DNA 
are not cleaved with Mwol for the lack of the Mwol recognition sequence. 
[0217] This marker was developed by applying the dCAPS method. 

(12) Development of PCR marker B2387 Bfal 

[021 8] The following primer pair was designed for the base sequence of the BAC clone described above (Accession 
No. AC068923): 



5'-cactgtcctgtaagtgtgctgtgc-3' (SEQ ID NO: 65) 

and 



5' -caagcgtgtgataaaatgtgacgc-3 ' (SEQ ID NO: 66). 

PCR was routinely performed using this primer pair in combination with total DNAs of MS-FR Koshihikari (genotype 
of the RM locus: Rf-1 Rf-1) and Koshihikari as templates. The resulting amplification products of about 1300 bp were 
electrophoresed on an agarose gel and then purified by QIAEXII (QIAGEN). Analysis of the base sequence of the 
purified DNA by a DNA sequencer 377 (ABI) showed several polymorphisms. 

[0219] One of them can be detected by PCR amplification of a region surrounding said position using the followinq 
primer pair: 
B2387 Bfal F: 



5' -tgcctactgccattactatgtgac-3' (SEQ ID NO: 67) 



and 

B2387 Bfal R: 
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5' -acatactaccgtaaatggtctctg-3' (SEQ ID NO: 68). 



The amplification products can be visualized by electrophoresis on an agarose gel after treatment with Bfal. Thus, the 
change can be detected as a difference in mobility in the agarose gel due to the difference in the length of DNA after 
Bfal treatment because the amplification products from Koshihikari DNA having an Bfal recognition sequence (CTAG) 
are cleaved with Bfal while the amplification products from MS-FR Koshihikari (Rf-1 Rf-1) DNA are not cleaved with 
Bfal for the lack of the Bfal recognition sequence. 

(13) Segregation analysis 

[0220] Two recombinants between the Rf-1 and S12564 Tsp509l loci (RS1 and RS2) and 8 recombinants between 
the Rf-1 and C1 361 Mwol loci (RC1 to RC8) obtained in Example 1 were genotyped at the 1 2 DNA marker loci developed 
in (1) to (12) above. The results are shown in Table 4 along with the genotypes of each recombinant at the S12564 
Tsp509l andC1361 Mwol loci. 



Table 4. 

Genotypes of recombinants proximal to the Rf-1 locus at various marker loci 



Locus 


RS1 


RS2 


RC1 


RC2 


RC3 


RC4 


RC5 


RC6 


RC7 


RC8 


S12564Tsp509l 


J 


J 


H 


H 


H 


H 


H 


H 


H 


H 


P4497 Mbol 


J 


J 


H 


H 


H 


H 


H 


H 


H 


H 


P9493 Bsll 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


P23945 Mbol 


H 


H 


H 


H 


H 


H 


H 


H 


i H 


H 


P41030Taql 


H 


H 


H 


H 


H 


H - 


H 


H 


H 


H 


P451 77 BstUI 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


B60304 Mspl 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


B59066 BsaJI 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


B56691 Xbal 


H 


H 


H 


H . 


H 


H 


H 


J 


H 


H 


B53627 BstZ17l 


H 


H 


H 


H 


H 


H 


H 


J 


H 


H 


B40936 Msel 


H 


H 


H 


H 


H 


H 


H 


J 


H 


H 


B19839 Mwol 


H 


H 


H 


H 


H 


J 


H 


J 


H 


H 


B2387 Bfal 


H 


H 


H 


H 


H 


J 


R 


J 


H 


J 


C1361 Mwol 


H 


H 


J 


J 


J 


J 


J 


J 


J 


J 



J: Homozygous for Koshihikari 
H: Heterozygous for Koshihikari /MS-FR Koshihikari 



[0221] Table 4 shows that all the recombinants have an indica-derived Rf-1 chromosomal region between P9493 
Bsll and 59066 BsaJI. This result showed that recombinant pollens having the chromosomal organization as shown 
in Fig. 3 have pollen fertility, i.e. the Rf-1 gene is functional in these pollens. This means that a sequence determining 
the presence of the function of the Rf-1 gene is included in the indica region common to these recombinant pollens i 
e. in a region from the P4497 Mbol to B56691 Xbal loci (about 65 kb) as estimated at maximum. 
[0222] However, there is a possibility that it is important for the expression of the genetic function of the Rf-1 gene 
that the Rf-1 gene is partially of the indica genotype, and that the genetic function may not be significantly changed 
whether the remaining regions are of the japonica or indica genotype. Therefore, it cannot be concluded that the com- 
mon indica region above (bases 1239-66267 of SEQ ID NO: 27) completely contains the entire Rf-1 gene. However, 
it is thought that at least SEQ ID NO: 27 completely contains the entire Rf-1 gene for the following reasons: 

1) the size of a gene is normally several kilobases, and rarely exceeds 10 kb; 

2) the genomic base sequence of IR24 determined by the present invention (SEQ ID NO: 27) completely contains 
the common indica region above; 

3) the 5' end of SEQ ID NO: 27 is located 1238 bp upstream of the 5' end of the common indica region above and 



EP 1 437 409 A1 

forms a part of another gene (S 12564); and 

4) the 3' end of SEQ ID NO: 27 is located 1 0096 bp downstream of the 3' end of the common indica region above. 
Example 4: Complementation assay for a 9.7 kb fragment from XSE1 
(Materials and Methods) 

[0223] The X phage clone XSE1 (Figs. 1 and 5) was completely digested with Notl and electrophoresed on an agarose 
gel. The separated 9.7 kb fragment (including bases 1-9657 of SEQ ID NO: 27) was purified by QIAEXII (QIAGEN). 
[0224] On the other hand, an intermediate vector pSB200 having a hygromycin-resistant gene cassette was prepared 
on the basis of pSB11 (Komari et al., supra.). Specifically, a nopaline synthase terminator (Tnos) was first fused to a 
ubiquitin promoter and a ubiquitin intron (Pubi-ubil). A hygromycin-resistant gene (HYG(R)) was inserted between ubil 
and Tnos of the resulting Pubi-ubil-Tnos complex to give an assembly of Pubi-ubil-HYG(R)-Tnos. This assembly was 
fused to a Hindlll/EcoRI fragment of pSB1 1 to give pKY205. Linker sites for adding restriction enzyme sites Notl, NspV, 
EcoRV, Kpnl, Sad, EcoRI were inserted into the Hind III site upstream of Pubi of this pKY205 to give pSB200 having 
a hygromycin-resistant gene cassette. 

[0225] After the plasmid vector pSB200 was completely digested with Notl, DNA was recovered by ethanol precip- 
itation. The recovered DNA was dissolved in TE solution and then dephosphorylated by CIAP (TAKARA). The reaction 
solution was electrophoresed on an agarose gel, and then a vector fragment was purified from the gel using QIAEXII 
(QIAGEN). 

[0226] The two fragments prepared above, i.e. a 9.7 kb fragment from XSE1 and a vector fragment were subjected 
to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). After the reaction, DNA was recovered by ethanol 
precipitation. The recovered DNA was dissolved in pure water (prepared by a Millipore system) and then mixed with 
E. coli DH5a cells, and the mixture was electroporated. After electroporation, the solution was cultured with shaking 
in LB medium (37°C, 1 hr) and then plated on an LB plate containing spectinomycin and warmed (37°C, 1 6 hr). Plasmids 
were isolated from 24 of the resulting colonies. Their restriction enzyme fragment length patterns and boundary base 
sequences were analyzed to select desired E. coli cells transformed with recombinant plasmids. 
[0227] The E. coli cells selected above were used for triparental mating with the Agrobacterium tumefaciens strain 
LBA4404/pSB1 (Komari et al., 1996) and the helper E. coli strain HB101/pRK2013 (Ditta et al., 1980) according to the 
method of Ditta et al. (1 980). Plasmids were isolated from 6 of the colonies formed on an AB plate containing spectin- 
omycin and their restriction enzyme fragment length patterns were analyzed to select desired Agrobacterium cells. 
[0228] The Agrobacterium cells selected above were used to transform MS Koshihikari (having BT cytoplasm and 
a nucleus gene substantially identical to Koshihikari) according to the method of Hiei et al. (1 994). Necessary immature 
seeds of MS Koshihikari for transformation can be prepared by pollinating MS Koshihikari with Koshihikari. 
[0229] Transformed plants were transferred to a greenhouse under long-day conditions after acclimation. 48 indi- 
viduals grown to a stage suitable for transplantation were transplanted into 1/5000a Wagner pots (4 individuals/pot) 
and transferred into a greenhouse under short-day conditions 3-4 weeks after transplantation. About one month after 
heading, seed fertility was tested on standing plants. 

(Results and Discussion) 

[0230] All of the 48 transformed individuals were sterile. This indicates that the 9.7 kb insert fragment does not contain 
at least the full-length Rf-1 gene. 

Example 5: Complementation assay for a 14.7 kb fragment from XSE7 
(Materials and Methods) 

[0231] The X phage clone XSE7 (Figs. 1 and 5) was completely digested with EcoRI and then DNA was recovered 
by ethanol precipitation. The recovered DNA was dissolved in TE solution and then blunted by DNA Blunting Kit (TAKA- 
RA). The reaction solution was electrophoresed on an agarose gel to separate a 14.7 kb fragment (includinq bases 
261 8-1 7261 of SEQ ID NO: 27), which was purified by QIAEXII (QIAGEN). 

[0232] On the other hand, the plasmid vector pSB200 was completely digested with Sac! and then DNA was recov- 
ered by ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephosphorylated by CIAP 
(TAKARA) and DNA was recovered by ethanol precipitation. The recovered DNA was dissolved in TE solution and 
then blunted by DNA Blunting Kit (TAKARA). The reaction solution was electrophoresed on an agarose gel and then 
a vector fragment was purified from the gel using QIAEXII (QIAGEN). 

[0233] The two fragments prepared above, i.e. the 14.7 kb fragment from XSE7 and the vector fragment were sub- 
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jected to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). Subsequently, transformed plants were prepared 
and studied according to the method described in Example 4. 

(Results and Discussion) 

[0234] All of the 48 transformed individuals were sterile. This indicates that the 14.7 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 



Example 6: Complementation assay for a 21 .3 kb fragment from XSF4 
(Materials and Methods) 



[0235] The X phage clone XSF4 (Figs. 1 and 5) was partially digested with Notl and electrophoresed on an agarose 
gel. The separated 21 .3 kb fragment (including bases 1 2478-33750 of SEQ ID NO: 27) was purified by QIAEXII (QIA- 
GEN). 

[0236] On the other hand, the plasmid vector pSB200 was completely digested with Notl and then DNA was recovered 
by ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephosphorylated by CIAP (TAKA- 
RA). The reaction solution was electrophoresed on an agarose gel, and then a vector fragment was purified from the 
gel using QIAEXII (QIAGEN). 

[0237] The two fragments prepared above, i.e. the 21 .3 kb fragment from XSF4 and the vector fragment were sub- 
jected to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). Subsequently, transformed plants were prepared 
and studied according to the method described in Example 4. 

(Results and Discussion) 

[0238] All of the 48 transformed individuals were sterile. This indicates that the 21.3 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 



Example 7: Complementation assay for a 13.2 kb fragment from XSF20 
(Materials and Methods) 



[0239] The X phage clone XSF20 (Figs. 1 and 5) was completely digested with Sail and then DNA was recovered 
by ethanol precipitation. The recovered DNA was dissolved in TE solution and then blunted by DNA Blunting Kit (TAKA- 
RA). The reaction solution was electrophoresed on an agarose gel to separate a 13.2 kb fragment (including bases 
26809-40055 of SEQ ID NO: 27), which was purified by QIAEXII (QIAGEN). 

[0240] On the other hand, the plasmid vector pSB200 was completely digested with EcoRV and then DNA was 
recovered by ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephosphorylated by 
CIAP (TAKARA). The reaction solution was electrophoresed on an agarose gel, and then a vector fragment was purified 
from the gel using QIAEXII (QIAGEN). 

[0241] The two fragments prepared above, i.e. the 13.2 kb fragment from XSF20 and the vector fragment were 
subjected to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). Subsequently, transformed plants were pre- 
pared and studied according to the method described in Example 4. 

(Results and Discussion) 

[0242] All of the 44 transformed individuals were sterile. This indicates that the 13.2 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 



Example 8: Complementation assay for a 16.2 kb fragment from XSF18 
(Materials and Methods) 

[0243] The X phage clone XSF1 8 is identical to XSF20 at the 5' and 3' ends (bases 20328 and 41 921 of SEQ ID NO- 
27, respectively), but lacks internal bases 33947-38591 . Thus, it comprises bases 20328-33946 and 38592-41921 of 
SEQ ID NO: 27. This is because clone XSF18 was initially isolated but found to contain the above deletion during 
amplification after isolation, and therefore, the amplification step was freshly taken to isolate a complete clone desia- 
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[0244] The X phage clone XSF1 8 (Fig. 5) was completely digested with Notl and electrophoresed on an agarose gel 
The separated 16.2 kb fragment (including bases 21065-33946 and 38592-41921 of SEQ ID NO- 27) was Durified bv 
QIAEXII (QIAGEN). y 

[0245] On the other hand, the plasmid vector pSB200 was completely digested with Notl and then DNA was recovered 
by ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephosphoiylated by CIAP (TAKA- 
RA). The reaction solution was electrophoresed on an agarose gel, and then a vector fragment was purified from the 
gel using QIAEXII (QIAGEN). 

[0246] The two fragments prepared above, i.e. the 16.2 kb fragment from XSF18 and the vector fragment were 
subjected to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). Subsequently, transformed plants were pre- 
pared and studied according to the method described in Example 4. 

(Results and Discussion) 

[0247] All of the 48 transformed individuals were sterile (Fig. 6). This indicates that the 1 6.2 kb insert fragment does 
not contain at least the full-length Rf-1 gene. 

Example 9: Complementation assay for a 12.6 kb fragment from XSG22 
(Materials and Methods) 

[0248] The X phage clone XSG22 (Figs. 1 and 5) was partially digested with Notl and electrophoresed on an agarose 
gel. The separated 12.6 kb fragment (including bases 31684-44109 of SEQ ID NO: 27) was purified by QIAEXII (QIA- 
GEN). 

[0249] On the other hand , the plasmid vector pSB200 was completely digested with Notl and then DNA was recovered 
by ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephosphorylated by CIAP (TAKA- 
RA). The reaction solution was electrophoresed on an agarose gel, and then a vector fragment was purified from the 
gel using QIAEXII (QIAGEN). 

[0250] The two fragments prepared above, i.e. the 12.6 kb fragment from XSG22 and the vector fragment were 
subjected to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). Subsequently, transformed plants were pre- 
pared and studied according to the method described in Example 4. 

(Results and Discussion) 

[0251] All of the 48 transformed individuals were sterile. This indicates that the 12.6 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Example 10: (1) Complementation assay for a 15.7 kb fragment from XSG16 
(Materials and Methods) 

[0252] The X phage clone XSG1 6 (Figs. 1 and 5) was partially digested with Notl and electrophoresed on an agarose 
gel. The separated 1 5.7 kb fragment (including bases 38538-54123 of SEQ ID NO: 27) was purified by QIAEXII (QIA- 
GEN). 

[0253] On the other hand, the plasmid vector pSB200 was completely digested with Notl and then DNA was recovered 
by ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephosphorylated by CIAP (TAKA- 
RA). The reaction solution was electrophoresed on an agarose gel, and then a vector fragment was purified from the 
gel using QIAEXII (QIAGEN). 

[0254] The two fragments prepared above, i.e. the 15.7 kb fragment from XSG16 and the vector fragment were 
subjected to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). Subsequently, transformed plants were pre- 
pared and studied according to the method described in Example 4. 

(Results and Discussion) 

[0255] Of the 47 transformed individuals, at least 37 individuals clearly restored fertility (Fig. 6). This indicates that 
15586 bases (bases 38538-54123 of SEQ ID NO: 27) derived from rice (IR24) in the 15.7 kb insert fragment include 
the full-length Rf-1 gene. 
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(2) Complementation assay for an internal 11 .4 kb fragment in XSG16 
(Materials and Methods) 

[0256] The X phage clone XSG16 was completely digested with AlwNI and BsiWI and then DNA was recovered by 
ethanol precipitation. The recovered DNA was dissolved in TE solution and then blunted by DNA Blunting Kit (TAKARA) 
The reaction solution was electrophoresed on an agarose gel to separate a 11.4 kb fragment, which was purified by 
QIAEXII (QIAGEN). 

[0257] The plasmid vector pSB1 1 (Komari et al. Plant Journal, 1 996) was completely digested with Smal and then 
DNA was recovered by ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephospho- 
rylated by CIAP (TAKARA). The reaction solution was electrophoresed on an agarose gel, and then a vector fraqment 
was purified from the gel using QIAEXII (QIAGEN). 

[0258] The two fragments prepared above were subjected to a ligation reaction using DNA Ligation Kit Ver 1 (TAKA- 
RA). After the reaction. DNA was recovered by ethanol precipitation. The recovered DNA was dissolved in pure water 
(prepared by a Millipore system) and then mixed with E. coli DH5a cells, and the mixture was electroporated After 
electroporation, the solution was cultured with shaking in LB medium (37°C, 1 hr) and then plated on an LB plate 
containing spectinomycin and warmed <37°C, 16 hr). Plasmids were isolated from 14 of the resulting colonies and 
their restriction enzyme fragment length patterns and boundary base sequences were analyzed to select desired E 
coli cells. 

[0259] The E. coli cells selected above were used for triparental mating with the Agrobacterium tumefaciens strain 
LBA4404/pSB4U (Takakura et al., Japanese Patent Application No. 2001-269982 (WO02/019803 A1)) and the helper 
E. col. strain HB1 01/pRK201 3 (Ditta et al. , 1 980) according to the method of Ditta et al. (1980). Plasmids were isolated 
from 12 of the colonies formed on an AB plate containing spectinomycin and their restriction enzyme fragment lenqth 
patterns were analyzed to select desired Agrobacterium cells. 

[0260] The Agrobacterium cells selected above were used to transform MS Koshihikari (having BT cytoplasm and 
a nucleus gene substantially identical to Koshihikari) according to the method of Hiei et al. (1 994). Necessary immature 
seeds of MS Koshihikari for transformation can be prepared by pollinating MS Koshihikari with Koshihikari 
[0261] Transformed plants were transferred to a greenhouse under long-day conditions after acclimation 1 20 indi- 
viduals grown to a stage suitable for transplantation were transplanted into 1/5000a Wagner pots (4 individuals/pot) 
and transferred into a greenhouse under short-day conditions about one month after transplantation. About one month 
after heading, one typical ear was sampled from each plant to evaluate seed fertility (the percentage of fertile paddies 
to total paddies). ' ' r 

(Results and Discussion) 

[0262] Of the 120 transformed individuals, 59 individuals showed seed fertility of 10% or more among which 19 
individuals showed seed fertility of 70% or more. This indicates that the 11 .4 kb insert fragment (bases 42357-53743 
of SEQ ID NO: 27) contains an essential Rf-1 gene region for expressing a fertility restoring function. 

(3) Complementation assay for an internal 6.8 kb fragment in XSG16 

(Materials and Methods) 

[0263] The X phage clone XSG1 6 was completely digested with Hpal and AlwNI and electrophoresed on an agarose 
gel. The separated 6.8 kb fragment was purified by QIAEXII (QIAGEN). 

[0264] The subsequent procedures including the preparation of the plasmid vector pSB1 1 were performed accordina 
to the method in (2) above. a 

(Results and Discussion) 

[0265] Of the 120 transformed individuals, 67 individuals showed seed fertility of 10% or more among which 26 
individuals showed seed fertility of 70% or more. This indicates that the 6.8 kb insert fragment (bases 42132-48883 of 
SEQ ID NO: 27) contains an essential Rf-1 gene region for expressing a fertility restoring function 
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Example 1 1 : Complementation assay for a 1 6,9 kb fragment from XSG8 



(Materials and Methods) 

[0266] The X phage clone XSG8 (Figs. 1 and 5) was completely digested with Notl and electrophoresed on an agarose 
gel. The separated 1 6.9 kb fragment (including bases 46558-63364 of SEQ ID NO: 27) was purified by QIAEXII (QIA- 
GEN). 

[0267] 0 n the other hand , the ptasmid vector pSB200 was completely digested with Notl and then DNA was recovered 
by ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephosphorylated by CIAP (TAKA- 
RA). The reaction solution was electrophoresed on an agarose gel, and then a vector fragment was purified from the 
gel using QIAEXII (QIAGEN). 

[0268] The two fragments prepared above, i.e. the 16.9 kb fragment from XSG8 and the vector fragment were sub- 
jected to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). Subsequently, transformed individuals were pre- 
pared and studied according to the method described in Example 4. 

(Results and Discussion) 

[0269] All of the 48 transformed individuals were sterile. This indicates that the 16.9 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Example 12: Complementation assay for a 20.0 kb fragment from XSH1 8 
(Materials and Methods) 

[0270] The X phage clone XSH18 (Figs. 1 and 5) was completely digested with Notl and electrophoresed on an 
agarose gel. The separated 20.0 kb fragment (including bases 56409-76363 of SEQ ID NO : 27) was purified by QIAEXII 
(QIAGEN). 

[0271 ] On the other hand, the plasmid vector pSB200 was completely digested with Notl andthen DNA was recovered 
by ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephosphorylated by CIAP (TAKA- 
RA). The reaction solution was electrophoresed on an agarose gel, and then a vector fragment was purified from the 
gel using QIAEXII (QIAGEN). 

[0272] The two fragments prepared above, i.e. the 20.0 kb fragment from XSH18 and the vector fragment were 
subjected to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). Subsequently, transformed individuals were 
prepared and studied according to the method described in Example 4. 

(Results and Discussion) 

[0273] All of the 44 transformed individuals were sterile. This indicates that the 20.0 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Example 13: Complementation assay for a 19.7 kb fragment from an overlapping region of XSG8 and XSH18 
(Materials and Methods) 

[0274] A plasmid (XSG8SB200F) isolated from desired E. coli cells obtained by ligation in Example 1 1 was completely 
digested with Sail and Stul and electrophoresed on an agarose gel. The separated 12.8 kb fragment (including bases 
50430-63197 of SEQ ID NO: 27) was purified by QIAEXII (QIAGEN). 

[0275] On the other hand, a plasmid (XSH18SB200R) isolated from desired E. coli cells obtained by ligation in Ex- 
ample 12 was completely digested with Sail, Stul and Xhol and electrophoresed on an agarose gel to separate a 6.9 
kb fragment (including bases 63194-70116 of SEQ ID NO: 27), which was purified by QIAEXII (QIAGEN). 
[0276] Further, the plasmid vector pSB200 was completely digested with EcoRV and then DNA was recovered by 
ethanol precipitation. The recovered DNA was dissolved in TE solution and then dephosphorylated by CIAP (TAKARA). 
The reaction solution was electrophoresed on an agarose gel, and then a vector fragment was purified from the qel 
using QIAEXII (QIAGEN). 

[0277] The three fragments prepared above, i.e. the 12.8 kb fragment from XSG8, the 6.9 kb fragment from XSH18 
and the vector fragment were subjected to a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). The ligation 
product contains a 1 9.7 kb fragment from an overlapping region of XSG8 and XSH18 (including 50430-70116 of SEQ 
ID NO: 27) (XSX1 in Fig. 5). Subsequently, transformed individuals were prepared and studied according to the method 
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described in Example 4. 
(Results and Discussion) 

5 [0278] All of the 40 transformed individuals were sterile. This indicates that the 19.7 kb insert fraqment does not 
contain at least the full-length Rf-1 gene. 
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SEQUENCE LISTING 
<H0> JAPAN TOBACCO INC. 
Syngenta Limi ted 

<120> A method for providing and inhibiting the rice fertility, and 
discerning the presence of the rice restorer gene by using the rice 
restorer gene to the rice BT type cytoplasmic male sterility. 



<130> 


t f f\ rrs n t\ A 

YCT739 




<150> 


JP 2001 


-285247 


<l5l> 


2001-09 


-19 


<150> 


JP 2001 


-309135 


<l5l> 


2001-10 


-04 


<150> 


JP 2002 


-185709 


<l5l> 


2OO2-06- 


-26 


<160> 


68 





<210> 1 
<211> 24 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of R1877 EcoRI marker 

sequence. 

<400> 1 

cattcctgct tccatggaaa cgtc 24 

<210> 2 
<211> 33 
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<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of Rl 877 EcoRI marker 

sequence. 

<400> 2 

ctcttlctgt atacttgagc tttgacatct gac 33 

<2lO> 3 
<2ll> 20 
<2l2> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of G4003 Hind 1 1 1 marker 

sequence. 

<400> 3 

gatcgacgag tacctgaacg 20 

<2l0> 4 
<211> 24 
<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of G4003 Hind 1 1 1 marker 

sequence. 

<400> 4 

aatagttgga ttgtcctcaa aggg 24 



<2l0> 5 
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<211> 27 
<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for 

sequence. 

<400> 5 

aaagcaaccg acttcagtgg catcacc 27 

<2lO> 6 
<211> 24 
<212> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for 

sequence. 

<400> 6 

ctggacttca tttccctgca gage 24 

<2lO> 7 
<2ll> 27 
<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for 

sequence. 

<400> 7 

gaccaccaat taactgatta agctggc 27 



amplification of C1361 Mwol marker 



amplification of C1361 Mwol marker 



amplification of G2155 Mwol marker 
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<210> 8 
<211> 27 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of G2155 Mwol marker 

sequence. 

<400> 8 

tttctggctc caataatcag ctgtagc 27 

<2l0> 9 
<2U> 27 
<212> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of G291 Mspl marker 

sequence. <400> 9 

ctgctgcagc aagctgcacc gaaccgg 27 

<2lO> 10 
<211> 24 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of G291 Mspl marker 
sequence. <400> 10 
acaltltttc ttccgaaact tccg 24 



<210> 11 
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<2I1> 24 
<212> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of R2303 BslI marker 

sequence. 

<400> 11 

alggaaagat acactagaat gage 24 

<210> 12 
<211> 24 
<212> DNA 

<2 1 3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of R2303 BslI marker 

sequence. 

<400> 12 

alcttatata gtggcaggaa agee 24 

<2 1 0> 13 
<211> 24 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of S 1 00 1 9 BstUI marker 

sequence. 

<400> 13 

aacaatctta tcctgcacag actg 24 
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<210> 14 
<211> 24 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of S 1 001 9 BstUl marker 

sequence. 

<400> 14 

gtcacataga agcagatggg ttcc 24 

<210> 15 
<211> 24 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of SI 0602 Kpnl marker 

sequence. 

<400> 15 

agctgttgag agttctatgc cacc 24 

<210> 16 
<211> 24 
<2I2> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of S10602 Kpnl marker 

sequence. 

<400> 16 

tagccatgca acaagatgtc atac 24 
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<210> 17 
<211> 26 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of S12564 Tsp509I marker 

sequence. 

<400> 17 

ctagttagac cgaataactg aggttc 26 

<210> 18 
<21l> 27 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification of S12564 Tsp509I marker 

sequence. 

<400> 18 

tttgtgggtt tgtggcattg agaaaat 27 

<210> 19 
<211> 2240 
<212> DNA 

<213> Oryza saliva L. 

<223> PCR marker G4003 Hindi 1 1 

<400> 19 

gcggccgctc cgggaagtcg agcgagtaga cgcccctgac gccgtacgcg tcggcgagcc 60 
gcagcggcgt ctctggcggt gtgaaggaca gcccgttcag cgtcgcgcgg cgccgcccgt 120 
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Igatcgtcac cggcgccgtg ctccgcagca ggtacgcctg cgtcacgttg atcgacgagt 180 
acctgaacga Iccctgtggg ttcggcctcg ccgctccggc actcaggttc cacctgccca 240 
algcaaaaaa ccaaaaccca aaagcttaat gcgaataata catcattcca cgtatttaaa 300 
aaaataattt aUggtaaaa tttttataat gUUttagc gacgtaaatg tcaatgctga 360 
gaaataaacg ataalacttt aaatgaagtt ctaaaaltta aattttggca tcggttgalg 420 
tiggataaag aaaacgatgg aggctagtaa tttltcttct tttttaagta tctagattgt 480 
catatattga atttttcagt ttttcatccc tttgaggaca atccaactat tattttcctt 540 
ttcttatgta aaaggttgaa caacatattc aaacataaaa aaataaaatt aaatgaaata 600 
aalttacaat tcataaaatt tacagaattt atgttaagaa aataUcaaa cttagataat 660 
aalaaagcaa caaaatcgta ctaaaaagaa gtataattgt acattgtata ctactactcc 720 
tacaatttta gacttagaat ttttaatttc ctgaaatcta gtaatgccat ttttttcttt 780 
ctagttgaac cagacagtaa gttlaactcg aaacttataa gctaatgagc gaagtcgggc 840 
aaltcactcg tacctgacgg agcgagcttg gttcatggag aaggacttgt cgaactggtc 900 
ctggggaggg tcggggagcg ggccggaggc ccgcccccgg gagttggagt agcggaggac 960 
ggcgacgccg gcgacgcggc gccacacggt gtcgttcacc atgcgcgcgc tggcgacgac 1020 
gtagtagtcg gagctcgcgt tctggtcggt ggtgacgagg aaggagtagg actggccgac 1080 
gtggacgtcc aggtlggtgt agttctgctg cgtcgtgtag gagccctccg tctccaccag 1140 
caccatgttg tgcccctgga tcctgaagtt gaggctcgtc gacgtcccca cgttgtgcac 1200 
tcggatcctg tacgtcttgc ctglgtcccc acaccgacgt cgccgacaca cgcgcaaaag 1260 
ataatagact cattgtaagt aggtagtaac cttctccgtt tcatattata aatcgtttga 1320 
ttatattttt gttagttaaa cttctttaag ttttttltct ataaacttaa ttaaatctaa 1380 
agaattttaa laaaaaaaat caaacgactt ataatataaa atggatggag tagttgcatc 1440 
aatttgtgga tgaagcaaac aagattatat ccttttcatg agggtgaaag tattcagtga 1500 
acaatlcgtc agtttcaagt ttcalgaaat cggacagggt ctclgaaagl ctgtattm 1560 
ggtactgttg gattgactac tctggcttct gttgtcacat cttttgtatc ctagtttcgg 1620 
taaaaaaaat tttggcattt ttactcctat cgttgatctg tttaactgaa accaltgcat 1680 
gatatactac tagcagacaa aaclggtgaa aatlcacgag aatgaacttt ttgtcagtta 1740 
agcattagcg gacagcttca gtaagcagag caggctgcct taaggcttaa agcactatct 1800 
tccacaacac tltgtcctac aatcaaaitc caaalttact atcacaaaaa gcgaaggaac 1860 
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taactaaacc ttactcctac tagtactact gctatgacta tgaaacaaga ttccaatcca 1920 

aagaaaacac agtgctcgat cagcatgata aaagcaacga aacctgctca tccagctgcc 1980 

aaaatgccac cccactgact ctacgtacgt actacgtatt gacgctgtaa aaaactagcc 2040 

gtagtacaga gaagaggacc caaagtttcg Icaaaaattt tattttaccc ggatccacat 2100 

tgatggtctc gtactcgatg ccggccggga caaggctgtc gttgtacctg tacgggccct 2160 

tgccgttaat cagcacgccg tccggcatcc cgaggtcctt gccactgtcc agcatctlcc 2220 

tcagatcctg caacgaattc 2240 

<2I0> 20 
<2I1> 2601 
<212> DNA 

<213> Oryza sativa L. 
<223> PCR marker C1361 Mwol 
<400> 20 

tcttgclgag atccaagtlg cggtaacttt gcccttttct ttttttcttc tcttctgaat 60 

tttttcatgg tttttgggag agatttlcgt aacttgatta cagltctagg aaaaggccac 120 

cttgttcaaa cagggcltlc ttgaaaggga tcaatttgct aggagtacat gattctaaaa 180 

gcgatttcga aataaaacac agttctcgat ctcatacctg aaaacaaaag gcccatactg 240 

tgtaaactgt gattatgctt ctgttaaatg ggatatttgt acaaaattga cgccaaccac 300 

ctalaaacag attgtgagct tttatcttag taaaataaaa tgtgacattc taclcagtgt 360 

tcagtgatcc gatgtcgtct cttctgcgta caacttctaa cagccgtttt cgglagtaca 420 

aactagcgaa acaccaaaaa cgcagcattt gagttctgga atacgctgaa attgttagaa 480 

tcaaccacga aaccaaaatc attgttcaga aacgttgcaa cgagataaaa cacaagaact 540 

tgttttaaca aagcatacgg acagtacata lacggttaca acacccagtc tttatacagt 600 

tctgctggag ttccatctac tggctgtcat tgtatctcag gacagacagg ttaacatagg 660 

tacaacacaa ttacaggcta aaccgaagcg aactacactg tcagcatctc taacagtatc 720 

glcaagcaag cttatltaca gctgctctag taaatttaca acgtccctgg cagaatccct 780 

clcgtttctg gcagcgacga ggcacggtcc atggccttag caggacatct cacccgtcag 840 

clgcatagaa agcaaccgac tlcagtggaa tcacctcctg ctcctgcaaa aaagttggtt 900 
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cgatcaatca cgcgtttaat ccaaaacaaa 
tacctcagag ttctctattt gctctgcagg 
cacctcaggg ctcttcatcc atgctttgtg 
catctgcgat acggalctaa aattaaggat 
gaagcaggtc cagttaaaga aaagtagcag 
agataaagaa atggttggtt tttccagttc 
gcattctggg caaaggatgg atgttatttt 
attgct tttt cccttgcttg caattttgtc 
ccagtgcaca aggcaagaag tgtgaaacaa 
gtcattggag gttgcagcag aatactacaa 
tgttttgctt gaaatgaatt caaccacttt 
gagaactgta attccattac aacagtgaag 
tcaaatgcct aatttlttag gtttgcttta 
gtcatgcgtt gctatcttca agaaatacta 
laacttaccc cgcttgagtt tclactgctg 
aaggtagcta catccltttc aagaagcatt 
gcagcaaggc gcttctgctt acgcaactcc 
acttgaagat tgtgtgaagt acctgtcggg 
tgggcggagc tgctttgctc attcgactcc 
ttctttcttc tctttccacg gtgtccagca 
ataggaaggt cctgatcatc agaagaccac 
ttgacaatgg caaagtcaga ttgctcatag 
tcctgtccta cccaagcaca ccagatcttg 
cataacctgt atgcaatatt tccatatccc 
tcctttagca atacactagg aataacgaga 
acatgatctt cagatacaga agcagttcta 
gtgccttttg cgccattgcg agagctagaa 
tcgttcgctg gaaccagagc cagctctctg 
gggtccttct cgcgcgagtt g 



atgggtatta attatgctag cctatgaagc 960 
gaaatgaagt ccagtggaac agttctcaag 1020 
tgcttcaatg gctttcagct talagcgaaa 1080 
gtcgacaatt acttaacaca acaaataatt 1140 
cgaagaalag cactctgaag lctgaacctc 1200 
atctccctca acatggattc cagtaccctg 1260 
cttaggtgca ttttttgcct ttcttcctcg 1320 
tgctagcatc tcatattggc ataaaatagt 1380 
algaaalgcc tgcaaaatta gccgtacaaa 1440 
atttttaaag aagaaactat acactgtcta 1500 
gcattatacg gtttggaatc cctggtttgt 1560 
aagttaccat aactaatgaa tggaaattag 1620 
atttatttat ctgtgagaaa tgctaagcat 1680 
agaaaclgca aaggcaaaga atgtttgaaa 1740 
caggctagat ttcctgtctt gcagttgagc 1800 
ggtcgcccac aaatatcaca agctttctca 1860 
ctcctcatag atttggtgga taagaggcca 1920 
gaacctgtta tgatagcttg gclattgtca 1980 
tctgaagatg ctlcttgatc tgaaaatgac 2040 
tcatcaatca cgaagaaaga tccagcagag 2100 
ttcctgccca actcaattgt ataagagaag 2160 
gtgtcacact catccaagcc atgggagcca 2220 
ctaatctttt tacttccttt gctagcttcc 2280 
aaaagalgca caggcaaatc cgaaacaaca 2340 
ggaccgtcag ttccaclttg gtttgacagc 2400 
ccattaccat gcgcatttgc accacggcgt 2460 
tcatctctca acctcgaagt cacttcagtg 2520 
gtgttctgcg agctcgagtc cagcaagagc 2580 

2601 
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<210> 21 
<211> 1333 
<212> DNA 

<213> Oryza saliva L. 
<223> PCR marker G2155 Mwol 
<400> 21 

ccctctgctt gatccagtgt acatccatgg 
agtgtgagac tggaaaaaaa talctgacgg 
gaaagttcag ttaactgtca acggctgtag 
atgaagcttt tactgtgcgt ggitaccagg 
gtaaaaaaat gtagtactat atgttaagac 
tttacatctc ccaaatgtta cctcagttgc 
tgatgtcggc agatcatgga ccatgtctca 
aatccaacca gatgagctgt gcaactgata 
ttcatgtaga tggaacttca atcccgaaga 
agaaaggatg gcggcaaaat ggcagcgata 
tttgtgcagc agttgagaca gtagcaagat 
ggatggaaga gtagtagtat gagatagagc 
agtaaaaggc attctcttct cttgtttgga 
cttggctgga atgtttggtt ggatcatgcg 
cctcgcttca tctctctcaa taattcaaag 
cgattcccgt taatgcaaat gcattatatc 
cagcaagcca gcaagtggtg tgaattgttt 
cacaaattcg cattgacttc tttcttagca 
tttgaacagg atgtctttcc aaggaaggtg 
1 1 tec 1 Cage atgaagagaa (gtgatcaac 
tttttgatac cctaaacagg agcacatcac 
tgattaagtt ggcatttcag atgcatccgt 



gttaggacag attagttact cagttaatta 60 
cagttltala agttgagtga ttgaactagl 120 
atttgggatg gcagactgtt ctgagtcaaa 180 
tgcagtaaaa taatttcaga tetaatcgea 240 
gagattggtc ggtcaaaatc tatctggccc 300 
aggtggtaaa aaaaaatcac tcgtttcacg 360 
aatgctgaaa ctctgaacaa tcaacaaaaa 420 
attgatcatc acactatttg caactcatct 480 
aataatgaca geaaaatget gcgatcctga 540 
aaaaaaaaat ggttggttac tgaagaatta 600 
aagagctagc (aagctagct aggtagagtt 660 
atggagcgcg acaactcaag tggatgclaa 720 
atcagaaaag aaaagaaaag acttgagctg 780 
cgctclcctt agcttagctc gecaagaaat 840 
ccacgagctc tetgetcata tecagtgega 900 
cagttcgaaa tgttacaatt cttgcgtttg 960 
aatccclcgt gcatttcaac gaaattctct 1020 
caattagtaa gcagtgacaa ataaagaatt 1080 
agatttttta tgtggatagc aaggatcgee 1140 
tttacacctt gcttacgatt atggcctlaa 1200 
atgcatgtcg acctgagacc accaattaac 1260 
cagttacatg atcaggtgat cgatggatca 1320 
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actgtaggtt tea j 33 3 

<210> 22 
<2ll> 863 
<212> DNA 

<2l3> Oryza sativa L. 
<223> PCR marker G291 Mspl 
<400> 22 

cgaacaggat caaaagtaga cgacgagggc atttagaagg agaggaattg tatttgttcc 60 

eggtatttaa tttttaaatt tgtggtcgga agtttcggaa gaaaaaatgt gctcatgagt 120 

gattattggc tctgaacacc aacctctctt ttcgttgatt ccttctgagg tgttgggtgt 180 

tgggacacga tgctgccgcc gacacgacac cgggttccac aatacactaa tctactcgcg 240 

acaccttcat tgaactgeat ataattattt agaaagtcca ttaacacatc ttataaaacc 300 

ttgttgaatc atataatcat tctataaagt ctalttgaac atcttatgaa aaaataagat 360 

ctgacctagt cgttacactc tcttacatti tccattagcc taactaattc cgtgcaggaa 420 

acgcccaaaa ataatagtac caatagtcca ctaatcccgt gccagaggcc gecaatgatt 480 

agtgattaac ccaaaaaaca laatcatcat cacacgccgc taatgaccag ctctcgctta 540 

gctcatccca caggcggccc ccacacgcca ctcctgccat gtgggcccac ctttcacacc 600 

ccccaccaac cagaaaaaaa actcccccaa aaaaaaaact tttaatgett atctcgcggc 660 

agtataaaag gcgaccccac cacccacaca caatcacagt cagcgaccca acccaacccg 720 

agecgaggag tegagtegtg tgaaaattac gaaattgece ttcgactcca ccaccaccac 780 

ccaccggcga ggegaggaga ggagaaaaat tgggaggaaa aaaaaaggga aaaagaaaaa 840 

gggtggagga gatttttgeg aag g 63 

<210> 23 

<211> 1510 

<212> DNA 

<213> Oryza sativa L. 

<223> PCR marker R2303 BslI 
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<400> 23 

tgccatgaag acctatggaa agaatatctt 
tctgtaacat ttagggctag gtcgaaggaa 
tttttttttt ctgtataggg ggaaagaaaa 
cagctggaga acaactggcg acatcgccga 
ttaaccctgt itacatagtt cttgagtttt 
cagtatgaca tctcgtgcag acgaaaatga 
atggaatggt aagaacttga gatgtatctg 
caaaatgatc aacatatgtt tctgctgtgc 
aaatggtggg atgtctgaag ctgagtaccg 
ggtaccatag catgttctat gtactaataa 
tcctcactgc aagttttgct tgaattgttc 
gctcaatgag ccagcagacg aagaacatac 
aaggcaagcc ttctcagttt cacatgctta 
atactcataa tgtaactctc tgaacagaia 
ctgacaacgg attggaggta tcccttcaat 
cataagcctt ggcatgatca tgactaactc 
cggtaggttt gggccgggcc actcagcaac 
cagtcatacc aggcaaccat cactgcacat 
gcggtcactg ctcgtgatct atgggcggta 
gaacatcttc tcttcagaal tcagagttca 
ttcgttcgcg gctcagggac agatatcagc 
tgtcttgaca ccaaactagt cagcaaagaa 
atacagctaa attgatcagg attcaggaag 
ctggaaatgg gatgccattt ggttatgtat 
latgtaatca gccgtcaaat gctggcgagt 
gcaataaaaa 



cttctcactc tgtgaatggt gagtttactc 60 
calgaagcat tgctgattca ctccactgtg 120 
tcctgctaca tgggcaggcc gcatgggtaa 180 
caactggggc aggttctact catcctctct 240 
tcagtactga tcgtaattgc cctgttattt 300 
ccaalgggct gcctatgctg gacctggtgg 360 
ttcctaggtt gcttaaccat ttgagagctt 420 
aatatcagat cctgacatgc ttgaagtggg 480 
gtcacacttc agtatctggg cactagcaaa 540 
ttttgctgca atgttgaact tctttgcatt 600. 
aggctcctct tttgatcgga tgcgatgtgc 660 
tcagcaactc ggaggtgatc gctgtcaacc 720 
gatttagcca tacctcttgg atatttcacc 780 
gtctaggtgt ccaaggaaag aaagtacaat 840 
ggcttccaaa tttgcagttt clcattgtcc 900 
tgaagctgac aatactttgt gtaaatttgt 960 
aacaggaagg ctgtggtgct ctggaacagg 1020 
tggtcgaaca tcgggctcgc tggatcggtc 1080 
aagcctttgc tttcttcaga gctcaaagta 1140 
taacaaattt ctgtcaattg tgcagcactc 1200 
atcggtggcg cctcatgact gcaagatgta 1260 
aagcagcaca ggttagtacg tgtccggcga 1320 
aaggtttgca atttgcaagg attggtagag 1380 
gtagaaataa gctgtaagcc Igtaagcgta 1440 
gtatttctga agtttgcaac gaaagttgca 1500 

1510 



<210> 24 
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<211> 1016 










5 


<212> DNA 

<213> Oryza sativa L. 












<223> PCR 


marker BstUI 










10 


<400> 24 














tggggattct 


tttctttaag caatttaaca 


ttattgtcct 


aacaatatac 


acaatat tgg 


60 


15 


tttttctttc 


agtatcaaat aattctttta 


ct 1 1 Igaaaa 


cacat 1 tgea 


atetfft tspa 

u * o » o * * Sot* 


1 ?0 


aacacaat ta 


tatcttgcac ttccttttgg 


aaat t taatc 


at t tgaaaac 


tga 1 1 ccfr £ t 


1 Rfl 




ttcatggctg 


taatcttctc ttgcgaacat 


cgctct t tct 


t tgatggt tc 


t C t ff 1 1 era p"a 




20 


agaagagcaa 


ccaagtaaat tttcgaaatg 


tttttttgtt 


ctttctat tc 


acca t tecaer 


300 




gttgtcaaag 


ccatcgagaa ggccataccg 


at tccgagag 


cgcaacccat 


tgee t teca t 


360 

u U v 




ggcccagcaa 


gggaagagct gaaggccatg 


gaggcgcaga 


aggtcgagat 


cgaccffcacc 


420 


25 


gcggcgctcc 


aggtgcgccg tgagctttgg 


c t ggggc t gg 


cat acc tegt 


eg t ccacar t 

v»(j l vvUgUv » 


4&0 




gccggcttca 


tgaggctcac attctgggag 


c tc teat err 


atgtcalffga 


arc p a f p t err 






t tctatgtga 


cctccatgta cttcatggcc 


ggctacacc t 


tct tec tcce 


cap p a a eta a or 


finn 


30 


gagccctcct 


tcgagggc 1 1 ct tcgagagc 


cggt tcgcgg 


CEaaecaeaa 


ffPffa t t £ a t or 


RfiO 

0 ou 




cacgcccggg 


atttcgatct ccgccggtat 


gacgagc tec 


2£C£a£CC t f? 


t PCrp r t CP P or 


790 




gtggt tcgga 


ctccgacgag cccctgcaga 


ccgtcgtcgt 


CRt Cff t C£ 1 C 


fir t r t t P P/a p or 


( OU 


35 


caggagagcc 


attgccattc ttactgccat 


t &cc a a t era i 


/» | f f or f flr/» f rr 
t I I I g I gC I g 


llClgtlClg 


840 




t tgtcagaat 


tttttcatgc ccagtttatg 


ggggt taagc 


tagct tctcc 


at tgtaccgt 


900 


40 


tctgatgtgc 


ggatgatgcg atgcaaagca 


tagtttgttg 


aagagatgac 


aaggcagat t 


960 




ttagct tgaa 


aacctggagg tgagaaaaaa 


aaatcctgat 


gtgtttgtgt 


gtgtga 


1016 



45 <210> 25 

<211> 676 

<212> DNA 

50 <213> Oryza sativa L. 

<223> PCR marker S10602 Kpnl 

<400> 25 



55 
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accaccttca tatgaagaaa ttaacggtgt ttlcatgagg aatccaacag tcgctgaatt 60 

' ggtggaaact gtggaattct tcttggctga gglaaccaat catcactica ccacaatgca 120 

caagtttgta gcttactact acagtacttc taataagttt tgtctgttga gattttattg 180 

ctgatttcta tgcatggtca tctttttgac aggccatcca gtcltatcgt gctgagagtg 240 

aaactgagct caacctggca gctggtgact atatagltgt ccggaaggta cggccctatc 300 

tlcccattgg acatgtltct aaccataaac atatctttgc tggacttttg tgggcaaagt 360 

(ggciacact aaacttgtgt tcattaacct gclcaatcag gtgtcaaaca atggatgggc 420 

agaaggtgaa tgcagaggga aagctggctg gtlcccttac gactacatcg agaaaaggga 480 

ccgtgtgctt gcaagtaaag tcgcccaggt cttctaggcg ttcaatgagc catacataca 540 

taaccctggl gttgiacact gtattatgat cgltcgtgat cttcaaagac cctctgatca 600 

gagaaatcac aaatattctt Itgttctatt attgtcatta tcactacccc ttttgtcaaa 660 

accagtgcag cctttt 676 

<210> 26 
<211> 1059 
<212> DNA 

<213> Oryza sativa L. 
<223> PCR marker Tsp509I 
<400> 26 

gcgagatcat gaactlgatt ttctggttgc calattgggc ttgcltglta accltgtaga 60 

gaaggatagc cttaataggl aagtccctca catgcttcct tccatttgct caattcatat 120 

cagtgttact gttctggcag ttccttgggg tcaggaclca gaaacatcca attaatgttc 180 

atgttctctt aacgactcag aaatacttta taacctctcc acaggglacg gclttcatct 240 

gcccgtgttc ctgttgatcl atctcagaat ccacagagtg aagagacaca gagagatgtc 300 

atagcactcc tctgttctgl attcttagca aglcaaggtg ctagtgaagc ttctggaact 360 

alatcaccgg taattcaaaa ttcttcaagt tccttttgla tgtagattat atctttgtaa 420 

aactcggcat ttattacctg ctctttgttt caaaaagcag tattltattt tgctccttag 480 

cataggtcag cagaacagtt gatcttattc agaaaacaat altttgcatg taacatactg 540 

Uatctatga gatgaaaaU aatgcatgtg taataatgtc aatgataaat atttgctatc 600 
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tgaalccagt ctaccaactc lagttagacc gaattactga ggttctattt caaagaataa 660 

tttagtgcac catitgttca actactatga agtaaaatgg tattcccttc tattgacatc 720 

gggttagaag tgaaaggcca tcttaatgcg atgttctcaa tgccacaaac ccacaaattt 780 

cattaacaca tacagattat tattaacata gclataaatt ggatttccag aagcttgagt 840 

tgaatttatt ttgttacaat tgaaagcact gggaacatta gcattttttt ttagttcttg 900 

gttattgcaa tttataatgt tatacagaac tgtgtacctc acaatgcatt cattatgaca 960 

ttctatgaac catttgattg actgttgctt gtaaacaaca ggatgatgag gagtctttga 1020 

tgcaaggagc acgggaagct gaaatgatga tcgtagagg 1059 



<210> 27 

<211> 76363 

<212> DNA 

<213> Orza sativa IR24 

<400> 27 



gatcaactaa 


caacctct 1 1 


gcagcaaaaa 


gggctctgca 


gatggactga 


tactctgacc 


cattcttttt 


ttttttatgt 


tt tcccctgt 


ct tctcalct 


agcgatgt ta 


tttttcttag 


tctaal taag 


tgtaaagtag 


caacagcata 


ttcagttatt 


cagagacgtc 


catacagt 1 1 


gaagct taga 


ataaatcaca 


tgttagcaat 


gtggccatca 


aat tctgtgt 


agaagtgtat 


cgctctgtt t 


tgtggaagi t 


aagaagtccc 


gagat tgatg 


caactctgat 


tgcagcat 1 1 


catgatgtt t 


at tgtttagt 


actacaagat 


attttataaa 


atct tggagt 


aacaagttca 


agtctatgta 


tat tgtctcc 


tttgttttta 


gtaattttct 


gaaggt t tea 


ccggtt tegg 


agacaaaacg 


gtccctccaa 


cacataaatg 



agcatacaca caagtgtttg tcttggcctg 60 
tgcagtgggc ttgggagcta acaatggttt 120 
tgtttttgct catgttttgt gtaatttttt 180 
catgatggga glagccctcc tttttttttc 240 
gggatgaatg ttcagtgtag tgtgtggtgt 300 
gtaccttgtg accacacgtc ttaatctgat 360 
gcaatatcat ctgcgtcttc tctcactttg 420 
ggttggtgtg ctgttgcaaa tgeegtatte 480 
tagttgaaat accgattttt catgatctcg 540 
ctttttatta gaatgtacac tecatgetat 600 
ttggttaacc attattttaa (atcataata 660 
taatacatga tagcataact ttttgaggct 720 
aactaagcac tcaalaaatt attgatggct 780 
cccgtgcttt ataaatagct teggcacaaa 840 
gtlgagttta cgttttcatt atctttggta 900 
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aaatcaagtc caccacgtag acactcataa 
cttgagtcta tcttaccttt gatatcggac 
tattattcal attacacctg aactttatat 
tttaattgca tacatgctga aatagtttga 
tcgtcttaag ttgcaattca ttttatccct 
atagtactaa ttcacaltac aatatagtat 
atattaaata ttcaaaacta gtcatttaag 
cacaalataa atggcattac aaaacttagg 
tgctagagga atttclatca aaagttiatg 
aagctattct tgtcgtaaac gttaaatatt 
aaaacltagt tttcaaaatg attggtctgt 
atctaacagt ttcclggcag aaactgtaat 
ttcaaatatg tgtacgtata tctgatgtga 
algaggcctt acagatatat ttgatgggtg 
tttttaaagf alacggacac acatttgaag 
tacatattct gcctgtaaac aacgagacaa 
caaacgttta ctgcagcatc acattglcaa 
glctcgtaat ttacatgcaa actgtgtaat 
catgcatgtc caaatatttg atgcgatctt 
ggatcaaatt tgctgaattt ttccagacgt 
gattcgccac cgacgccttg gtttccaacg 
gctctccatc gccatcggcg gccaacggcg 
actcgccgcc gccccctgat ccaaagcctc 
aggaggagga ggaggcgtga gcccctatgg 
acgccgccgg cgccggcgac gccacgccgt 
agaggccccc cccccccgcc gctcgctgat 
tgtgatttgg gtgttttltt tttttccgca 
ggcgtggagt gccagccgca tcgggtgcgc 
acggcgagct ggaggaggcg gagggagacc 



caaaagtttg aatatcctca gaaattttga 960 
atccaaccct ccctccctcc ctgaacttta 1 020 
tattcatatt acaccctgaa gtggttttca 1080 
caacgtgaga tgcactaaaa atctacacgt 1 l 40 
tttctttttc tctcttacat aggaatatca 1200 
aaattggtaa tcgattat tg gcaatatact 1260 
ctgccaaata agtaaaccac tatcgaaaac 1320 
gggttgaata tccaatttta aagttcatga 1380 
ggtacatatg gactttttcc tttttaaaag 1440 
ttttgtactt tattttttat gattgaaaaa 1500 
atacaagcat caattagact laataaattc 1560 
ttgtttttgt tattagacta cgtltattat 1620 
caaccaaacc caaaaatttt ccctaactcc 1680 
taaagttttt taagttcttt gggtgcaaag 1740 
Cat taaatat agacaaataa caaaacatat 1800 
atttattaag cctaattaat ctgtcattag 1860 
atcatagcgt aattaggctc aaaaatattc 1920 
tggttttttt ttcgtcaaca tttaalactc 1980 
tttggccaaa ttttgttgga atctaaacaa 2040 
cacggcltgt tcatccatcg ttcgcalcgc 2100 
aattttatca tccgcttaaa tacatccaaa 2160 
accgctccgc tctacccaat ccacccatcc 2220 
cgccgcgccg ccgtcgagag gaggaggagg 2280 
ggaccctcct ccggccgcgt ccgcttgccc 2340 
cgaccgcgca cggtagccac gcgcctctcg 2400 
ctctcttctc atcctgtttg ggtttgggtt 2460 
gcggtggtgg tgagcggtgg ccgcggccgt 2520 
cgccgcccgg gtccgcaggt tgcggtggcg 2580 
gtggtgagat cggatttcgc cgctggtggt 2640 
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gccgctacca tgggggattc gccgcaggcg 
tctcgcaaaa tgtgttgcta tgttcctctc 
Utgctggaa catlacattc ggaacgtlgl 
ggaggggaga aaaatcgttt gaacctgcag 
tgaaggtgtg gaaatcaaac alaatcattg 
catattgltg gcttataaca gttagctcca 
agtataaatt gaggataact ggcagttgtt 
aactagctat tgtgcattta Igtttcatgg 
agacglataa tgcaaaaaaa aaaaaaaaac 
gatgcaaata aaatcatgtt catggacaaa 
gcatttctta tgtggttact tttgtttgtt 
aagggtcaga ggggtttgct tctacgcggt 
cgccatggtt gtggttcatc cacttgtgct 
gcactacatt gcaaagattt gggcaactct 
cgagggaatg gagaacctgc caccgaatag 
gagtttcttg gatatctata cccttctaac 
gacaagtata tttatgttcc caattattgg 
tttgcggcgt atggacagca ggagccagct 
taagtagaca tatatacatt tacagtattt 
atgattttgg ggaaaacaca aaactctctt 
cacagttata gcaccttctt taaaatgaag 
acalaatgaa ttttgtttat ttctatcttt 
gatcgccttc ctatatattt gtagtatgtc 
tggtttgtta aaattcagga ctgtcttaaa 
tctgtatttt tctttccaga ggggactaga 
gttcagtaac caaacttagg ttacatiaca 
tgttaacctt ctcatggtgt actgacgtgg 
tgtggctaca aagaccgglg ctcctgtgat 
gatgccttct ggaatggaag gcatccttaa 



ctctcaggtt tgcagcctcc tccactctct 2700 
gctgggctgg cctcatagcc attaatgtag 2760 
tggcaattgc ttgacaaaat gtggaattgt 2820 
tgacaaaatt gccatctata attttaaaac 2880 
ccagcacatc attcttgtta accaccttga 2940 
caccaacttg gaaggtgtca atggaatgta 3000 
aagactttct acagaacttg tagcagctaa 3060 
aatttgagcg gcaatggata tttcttacta 3120 
tatgtctatg cagtttacat gtaatgtgcg 3180 
ctaatgggat tcataccaaa ttccagaatt 3240 
gatttggtta ccagacatcg atgtggtttc 3300 
gactgcagtt gcagcaatct ttttglttgl 3360 
cctatttgac cgataccgga ggagagctca 3420 
gacaatttcc atgttctaca agcttgacgt 3480 
tagccctgct gtctatgttg cgaaccatca 3540 
tctaggaagg tgtttcaagt ttataagcaa 3600 
atgggcaalg tatctcttag gagtaattcc 3660 
ggtatggclg tagtctcatc cctgctttct 3720 
ggtaaataaa caagatttta tgaatcatat 3780 
tgttggctgc cttgaacata gttctgttca 3840 
aactttgttg catacacata aggccaaacc 3900 
gaatgttagc atcgtttttg tttaatgcat 3960 
aacattgtat tccatgctga gcataacaaa 4020 
cggtgtgtgg atttggtgaa aaaaggagca 4080 
agcaaagalg gaaagctagg tgcatttaag 4140 
tctaatgaga tttltatalt cagtatataa 4200 
ttataaatgt ccccagagag gtgcattcag 4260 
acctattact cttctcggga cagggaaact 4320 
ttcaggttca gtaaagctca ttattcacca 4380 
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tccaattgaa gggaatgatg ctgagaaatt 
cactcttatt ctaaacggtt atggagtgca 
atggaaccta ttcaaaggca cagacaggct 
act agt tact aattactttc gtaatcagfa 
tacatttctg cacttggtaa atttacagaa 
tattcaccca gtcatatagt tgaagaggca 
cacctcttgt tcgattaaaa atgaccaaag 
ttagatatgg cccattaatt tgtttagttg 
agttaccatt tctatgatct aaaacaatca 
tcagtacaca tgtatgaatc ttaatattct 
accgtgtcag ctggttcaca ttagctagtt 
tttcatgcat atcatgctaa tttgcttgcc 
ataatttata tatgttttag acttctagtc 
tctagtatgg ctgatagcag actaggtgct 
agaagaaata caagactgtc cattgttagt 
ttacttttgc ccctatttta tttttaacaa 
tcttaattta ggagaagttg ctcgtttcat 
tcgagtctgt caatgcaatc ctgtgttctt 
attgaacact gaatggtaag actgcttctg 
acttgcagtt agtgctgcca cgccgattaa 
aaatgagtta tatttcattg gaaatcgaag 
ggaaacttgg taattactcc ataaagagag 
gctgtgagct ccgggctgta tgtattctgg 
cttaagttag cgatatcaaa tttggggaag 
ggtgcgtgac gttatctgct aggttcaaat 
ttcaaatcct agtgactatg aatattaatg 
gtttctttaa gattgtgcca tcggacgcca 
attcacttgt gttacatgca cgcactaaac 
tlctgggcta gaaactcaaa cgttgaattt 



atgttctgaa gcaaggaagg tgatagctga 4440 
ctaaagaaag atggtgtttt tttttattat 4500 
ttcaaggcta agcttgttac aggtactgat 4560 
taaataagct tgtgtagtgt aatggcattg 4620 
gaggcaagta atattttaga ggattgagtt 4680 
agtaacctgt aagagaggac tgaacattaa 4740 
agcatcaaac atgtattcga ggctgttact 4800 
tctatgtaca tcctagttgg tgtaaatgcc 4860 
actcttttag tatat 1 1 tea aaaacgaaat 4920 
tctctagctc gttacaaaag caacaaaggc 4980 
tgtacttagc attatccact agcaccttat 5040 
cacgttgagt gggaattttt ttcatgtttt 5100 
cacaatttat gtacttcatg ttcctgagcc 5160 
gaglgctglc etttmgea gactgaagag 5220 
cagatttgla aaaalagact ctgatgtagt 5280 
tacaaatata taacagatcc laagaactta 5340 
taaattaaat tgtgaagtaa aaatgtgtgc 5400 
gtttgaagat atggtglagg geaggecagg 5460 
ccttcagacg ttattgctaa atttttagct 5520 
gcagtagaac aaagtagttt tgtcgtgcac 5580 
cgaaaacgaa tcaaaagtta gaagaaaagg 5640 
tgcattttat tggtaagatg gtatccggaa 5700 
caaatttgat atgagatget cgattattgg 5760 
caccaaagga attattgtga aggagttatg 5820 
ccttgtggct atgaatattt atetgetagg 5880 
ggtaaggtaa gggatttatt gttaatttta 5940 
tteggtaact gtaataatgc tttgtattgg 6000 
atgtgcttta ccttttcatc tgtttttgcg 6060 
tccatggtct gctcaacttg acaattaclg 6120 
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cgtgtcaagc gatcttatac gcatactatg 
cagtataacg tgtgatattg atttttttaa 
ggaacaaaga ctttttttaa aagaagggta 
tatcagtgca catggcaagt gtgctcggca 
ttctatatgt tcttttttat aagggtggca 
ttggatttat taaaaactgt ataaattata 
tcttggagaa aatgtataag ttgcaaacaa 
gccggaaagg ccgaaggagg cctgacggag 
gccccccttt gccggttcgg ccggggactc 
aaggttcgcg accgtcgaaa gagcataaga 
gagaagcgta ataccctact cctgtgtttt 
tatgagccag cctctccctt gttctgggtt 
ccctctaagt gggcaaggtc ctccttttat 
cctcctttct gtggagactt accctacctt 
ccgtccgaat gaccttctga taggacggcc 
gtgcgacgtg ggattatggc tgtctgctga 
ttgctcattc clgtccacca cgcgtcagtt 
caacatcttg cctgtaatgg ttaggatgaa 
gccatctcta ggaggtaaca cgctagctcc 
cctgacggga tggggcgagg cgtgcgtcag 
accggtctgt gactggtcac agaccggata 
cgtgacacgc (cagccaaac cgcaataaat 
aacccalaca cgcggagcaa aaacccacga 
aggcgggtgc ggtccgaccc cctcgggggg 
ggcccgacgt cccccgaggg tgccaggcca 
agtcatgata ctcctgatcc catgtcaccg 
tcgccctctt taagggaagc ggtcgggcgt 
ggaccggtct ccacaattgg gcagaaaccc 
gtaactctac tatcaataat gagcggtctc 



cgcacaagtg attgtatacg gatatgatga 6180 
taaaaaaatg atgtlcattt ccttgatgaa 6240 
ttactaaaaa caaaaatgac aaaaacaaaa 6300 
attttttctc tgtactttaa acaaaaatac 6360 
caaatctttt aaatgagcca aatatctaca 6420 
atttatactc tgaaaggttg tgtgcalctc 6480 
acattaatcc acgttatgta actttttttc 6540 
cgtggggctc ctcaccggga gaccgcgcag 6600 
agggtgaaat tctaagctct ctgtatgtgg 6660 
cacgggcgat gtatacaggt tcgggccgct 6720 
gggggatctg tgtatgaagg agctacaaag 6780 
ccgaatctgg aaaagtccag tccagtcccc 6840 
atcttaaggg gataccacat gcaccatclc 6900 
ttcataaatg gacggagatt tgtatagttg 6960 
catacctacc tccacttccg ccgaaagcag 7020 
cgacatgacc agtgtcagac tggtcacaaa 7080 
tagcaatcta catgttggcc cttcttcaca 7140 
gcctggcata tatctaacca ggactaacgt 7200 
agctggggac gagcgcctag aagccctcgt 7260 
atcgcctgtc gccacctaac ctgcgatctg 7320 
aacgagtgca ctgcacttcg ttacatgcag 7380 
gtggttaggt gagccccgct gtgctcaccl 7440 
ggggtcgggg cgcctcggcc ctcggggccg 7500 
actaagagga gggcgaacac atcaccctcg 7560 
cgtgggcgat tgtgtctgcc tcaaacctct 7620 
acagtagccc ccggcgttat gccagggcga 7680 
gacgccactc ctaaggcctg gtgacaggtg 7740 
aacgglcaca aatcacgcac atcggcaatg 7800 
ttcaagactg ccacattact cgagtagcac 7860 
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acgaatctgg acatggcgat tcgtttcgtc 
gagcgtaatt aacgcgcgca cgatatgatc 
tcatgcgccg caagcgggcg aatgggatta 
cgaaa l&gtg ggcgcgagaa gcgaggagcc 
ctgaggaaag gatctgtttc cttcctttcg 
ccctaactcc ttctttcctg tgctctactt 
ttcctccggc gccatggcac ggggctccgc 
ccgcatcgtg agcgagaggc aggctgggct 
cggccgggag atagtcacgc tgggcgaggg 
cgtcttcttt ctcccctttg caatggcagg 
ggatgttctg aagttctacg atctccagat 
attggccatc ttcgcgcatc tgtgcgagat 
gttccggtgg ttcttcaccg tgcagtcggt 
citccagcca cgggggccgg tgctgaatcg 
ggacgactgg aagagcgact ggttctacac 
tccgagccag cccccggcgc aggcctccag 
ctatgacgcc gtcctcgacc gcctggcggg 
ggtgtacggc gactacctcc glcgtcggat 
clgggagtac accgggtccg aagactacat 
ggctcctgag gatttcaaga tagtggtcca 
gtccctcatt ccccaaggaa tcctccctct 
gaccattatg acggcggtcg gggcctcaga 
aggcgggagc cgtagggggg atcaatctac 
cgacggaggc ccgaggagca gccgccctgc 
aacacctccc ccatctcctc cccgaggggg 
ggagggggcc gcgccgacat cgcagcccga 
gatgggggag acagaaccat ctcagggaaa 
ccgaccccct cgcaggttcg tctctcaccc 
gttttcactc acccatcttg ttcgtcttct 



Iggagatatg gtaacglcgc tttggtcggc 7920 
tatctcgact gccacaaccg catatccacc 7980 
gtggaagcgt gggcgcgaga aacgaggggg 8040 
gggcacagcg ttggcaagag talaaaggca 81 00 
ccatcatttc ccttgtcttc gccgcttgcg 8160 
tcgccacacg cgctcgctct caatcttctc 8220 
tctgctcgat ggtagcgtgc tgccgccttc 8280 
gccgcgccgc ttcatgccgg aatctgccac 8340 
acgcccggcg ccagactacc cggggcggtc 8400 
gctggttccg ccattttctt ctttcttcat 8460 
ggcgcacctc acccccaacg cggtgatgac 8520 
gttcattggg gtgcgcccat ctcttcggct 8580 
gtcgccgcca tcggtagttg gtggctgcta 8640 
ctacatcccc tgcgccctcc gcaagaaglg 8700 
ccccctcgcc gacgaagcgc gcctccgact 8760 
ctggcgggcg ccggt agate tgggggatgg 8820 
cctacgatcc caggggctca cagggaccat 8880 
tgcgccgclc cagcggcgcg ctcggggcgc 8940 
gaggacccac cagggagtca gatgggactg 9000 
acgggtgctg aatctcaact ccatggaggc 9060 
ctgeagegat ccagaccgcg cctccatcct 9120 
ggagtgagct ccaaagggcc acgacggcgc 9180 
ccggggaggg ggtcgtgctt ctgggtctcg 9240 
cgacgcccgg gggaagagga agcagggagg 9300 
eggggeggtg cgtgccagca gcaggcgccc 9360 
gggggagege aagaagaagc ggctccgcaa 9420 
ccttatttcc cctctaaagt ggtcgtltaa 9480 
atcgtggctg tattcattct ctcaacgcga 9540 
ggtcttttct tctglttcag cgagatcccg 9600 
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tcgcgtccct cccgccattc caagtccggc 
gaggcccgga ggcgggaatc tgaccggcga 
gaggccgccc aggaggccgc ccgggctcgc 
gcccgggccc gccaggccga ggaagccgct 
gaggaagccg ctcgggagaa agccggattt 
gcagctcgcg atgaggtcgc gggcgcgtcg 
gcgacaactl ccggggcagc tggcgacgag 
tcaggcgacg cccaggacca accaggtccg 
ggcggcccga gccgcgtggc atcctctcca 
ccactgagcg cagagcccct tctgcaggcc 
gggcttagtg cccaggtgga ggccctgcaa 
gcgcatgtcg aggaggggcg gcgctcagtg 
caccgccggc atgtctcgga gcttgaagcc 
gaagtggagg aggagcgggg ggctgccctc 
gacaccclcc gcctlcaata cgggagctgg 
gcccaggggg tgcttgacgc tgccgctgcc 
gcgtcccgac ggcgcgaaga gacccttgag 
tgcgtcgtgg agagggatct ggcggaccgc 
ctggcggcgc acgagtccgc ctgtgccgaa 
gcgctcaccg agcgggagcg agctctcgag 
gacagcctgt ccctccgcga ggcagcgcag 
gtccgcgccg agaggaccgc actgaaccag 
gagctggacg cgagggcgcg cagcgacggg 
cgcctcgctg ctgccgaaca taccatcgcc 
ggggaggtcg aggccctccg cttggcaggc 
gtctcccgcc tagatcgcgc cggtcggcag 
tacgccgcca accatggagg cctcgcccag 
cggctccccg aggagctcga gaagacaatt 
gcggtggagc tcgtactggc gagttaccag 



cagtctgagg ccgaggatcc ggcggccgca 9660 
gaggccgcgg atcgcctacg ggaagccgag 9720 
caggtcgagg aaaccgctcg ggaggaggcc 9780 
cgggaggagg ccgcccgagc ccaccaggcc 9840 
cgccaggacg aggcaatggc gacttccgag 9900 
cttgagccca cttcctcggg cgacgctcag 9960 
gctgcgggcg cgtcgcttgg gcccactccc 10020 
agggacatcc ctgagtccgg cacttccatc 10080 
aggcggctct tccccacgcc ttctatcgcc 10140 
ttggccgccg caaacaccgc ggtgttggac 10200 
gcagagtggg cggagctcga cgccgcgtgg 10260 
gaggccatgg tggaggtggg ccgcaaggca 10320 
cglaagaagg tgttggcgga aatcgccaag 10380 
attgccacca gcgtgatgaa cgaggcgcag 10440 
gaggcggagc tagggaaaaa gctcgacacc 10500 
cgagaacagc gggcggggga gaccgaagcg 10560 
gcgcgcgcca tggcgctgga agagcgcgcc 10620 
gaggccgccg tcactatccg ggaggcaaca 10680 
gaggagtccg cactccgcct ccacgaggac 10740 
gaggccgagg ccgcggcgca acggctggcg 10800 
gaggagcagg cgcgccgcac tctggaatgt 10860 
caggccgctg acctcgaggc gcgggagaag 10920 
gcggctgcgg gcgaaaacga cttagccgcc 10980 
gatctgcagg gcgcgctaaa ctcgtccgcc 11040 
gaggtagggc ccggcatgct ttgggacgcc 11100 
gtgggcctct ggagagggcg gaccgtaaag 11160 
cgcctctcga agatggccag ggctctccaa 11220 
aagtcatcct cgagggacct cgcccaagga 11280 
gccagggacc ccaatttctc tccatggatg 11340 



EP 1 437 409 A1 



gcgctggatg agttccctcc tgggaccgag 
cgaccatatc gtccacagct tcgagggctc 
cgacgaggag gacaatgccg gtggtgcaga 
cgtatcggat tgatccccca agcccccgcc 
ctaaggcctt cgggcctctt ttttgtatag 
aatttttgtg tcaatttcat cttgctgtgt 
ccttttgcgt cttagcttga Uaagggctc 
Sgtcggggct agtgcctggg gagatccaca 
gaccgagggt tatgggtgac ccgattgtgg 
acgccccggg gcacggctcg gttctgggcc 
ccgagggcag gattgagcac gagtgaccta 
aaaacacaga tacagccttt aggaaattga 
gaaataagaa tgtgcatgtg tggcagcccc 
gggttggccc gagcccgaaa cctgacaccc 
ggtgttcgat gttccacggg ttaggcagct 
agcccggccg ggggacgccg accactcgat 
tcaatccagc acgcgtttgg acgcggcgta 
cccggacgtg acgctgatgg tagcgccgca 
ccgcgcgccg ccttcgctct tccaagtagt 
cagcctcgca gtacatggtg gcccgaggag 
cttccgcgcc gtagacgagg aagaaaggcg 
ggtttgccca gagcaccgct agcaactcct 
tgttgaaggt cttggtttta aggcctttga 
cattgcttct ggggtgggca ggtgaggcga 
agtcgccgaa gagttcacta gtgaattggg 
ctccaaaccg ggccgtgatg cccttaatga 
cgaccggata agcctcgggc cacttagtga 
acccgcccgg ggcccgccla aacggtccca 
acgaaagtgg tatggtctgc agggcclggg 



gacagcgcgc gcgcaggtcc gggatgccgc 1 1 400 
agcccctcgg ctcgcgttcg cccccaactc 1 1 460 
cgacagtgac gatgaggccg gcgacccggg 1 l 520 
attctttagt tttttcttct tttccttctt 11580 
atcaactlaa tctgfaatca aaaatgaaga 11640 
gtatgagatg aggatgatct gtgacgtggt 11700 
gtgcccaggl cccagtcctc aaaaggcgtg 11760 
tgtcgagact ggccaggccg ggaacgtggt 11820 
gtttttgccg attccccccc ggagttcacc 11880 
ccgtttggcg atttlagccg acccgagccc 11940 
tttcaagtca agattcttca aaaggaaaaa 12000 
aactgctttt attgaaatac tgaaataaga 12060 
cggccaacgc tgcacgcccg agggggtgcg 12120 
gacccccccc tcaggggtag aagcgacgaa 12180 
caatgccgtc gcccgtggcc agccgtatgg 12240 
acggaccctc ccacattggt gagagcttgc 12300 
ggacgaggtc gtcgacgcag agtgatcggg 12360 
ggctclgclg gtagcgcgcg gctctgaggg 12420 
cgaggtcatc tctgcgaagt tgatcttgat 12480 
acctcagggt gagctcggat gggagaaccg 12540 
tttccccggt tgctcggctt ggtgtagtlc 12600 
cgatccatga atcgtcglgc ttcttgagla 12660 
ggatttctga attggcgcgc tccacttggc 12720 
agcagagctt gatgcccatg tcttcgcagt 12780 
tgccattatc cgtaataata cggttaggca 12840 
atttaagtgc ggagtgctta tcgatcttga 12900 
acttgtcgat cgcgacatac agatactcaa 12960 
ggatatcgag cccctagaca gcaaalggcc 13020 
ccggctgatg gatttgcttg gcgtggaatt 13080 
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gacacgctct acatcgccgg accaggtcga 
aaccctggcg aaaagcttta ccaaccaagg 
cttcalggat atcggcaaga agcacaacgc 
ttccaltagc cgcgcgccga tagagggtcc 
gatggacgcg ttcactccct tcgcggtcct 
ggatctcggc aatccaagca atcaatctaa 
aggcctggac ttcgacgggc ctcgggggcc 
cgggtcgcgc cgacggctgg gcaagccttt 
gcgtggacgc gagccgtgag agttcgtcgg 
gaagctcaat cccgtcaaaa Iggcgctcca 
tctgcgggtc agagcaccgg tactccttac 
ctaacaccag gaggcggcgg atccccagtc 
cctcgtactc tgccatattg ttagtcgctc 
cgtctccgct cggagaggtc aacgtgaccc 
cgtcgaactg cattacccag tgggcggtgt 
cggggattga gacgggctcg ggagccgggg 
ggctcttgat agcgtgacgt ggttcaaagt 
atttcaccac ccgtcctgta ccctctcgat 
taaccacagt gacccgatgc gcctggaaat 
tagcgtaaag catcttctgg gcctgagggt 
taacaaagta gacgggccgc tgcacctttc 
catccctggg gcactcttcg tccaagcagc 
tgacctcggg gtcggaggat aacaggggcg 
gggggtcagg ggctcctggc gtcgtcggac 
ggggcct tag gcctccgtlc ggctcggggg 
tcagcacagg gttagcctcg gggtcaaagg 
cagggccttc ctgggggtcg ggggctccta 
ctccggtcgl cgggggcctc aggccaccgt 
tcccgggcca agtcggcaca gggtggggaa 



ccgcatcatt gagagctgtc ggccaataga 13140 
tgcgcgaggc ggaglgggct ccgcatlcgc 13200 
cttgttcccg aggaatgcac ttcaggagga 13260 
cllctaccag cacgtagcgt ttggagatgc 13320 
cgggtaaagt cttatctgtg aggtatgctt 13380 
gggagctggg agcgctcccc tcgggtcccg 13440 
ggtcaggcgc gtccgtctcc cctaaggggt 13500 
cttcaaaggc gcccggtggg gtctgggctc 13560 
caatcatgtt atcccgtctg ggcacatgcc 13620 
tacgccgtac ttggcgcacg taggcgtcca 13680 
agacttggtt aacgaccagc tgggaglcgc 13740 
cagctgccac tctgagtccg gcaaggagtc 13800 
gaaagtcgag gcggaccaag tatctgagga 13860 
ccgcaccggc gccctgaaga gacagggagc 13920 
gaggcagctg cgaggggtcc gtgctggcct 13980 
tccactctgc cacaaaatcg gcgagagcct 14040 
gcaaatcgaa ctcagaaagt tcgattgccc 14100 
tatgcaagat ttgaccgagg gggtaagacg 14160 
aatggcgcag tttcctcgag gccatcagaa 14220 
atcgggtttt ggcgtcccgg agggcctcac 14280 
ggtggggccg atcctcttcg ctaggggccg 14340 
ctcgcggggc gcacttgtct tctgtgctga 14400 
gccttcccac agtggctttg gggccgtcct 14460 
aagcgggcaa agggccaact ccggtcgtca 14520 
cctcttctcc ctgctctttc ccgggtcgag 14580 
gcgataggtg cggccttccc acagtggcct 14640 
gcaccgtctg acaagcgggc agagggccaa 14700 
tcggctcggg ggcclclcct ccctgctctc 14760 
gcgcgaaatg agaattatcc tcatcgcgct 14820 
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ccacaaccaa tgccgcacta actacttgcg 
cgtctggctc cggggcgacc ataactgggg 
gggcattttc agcttccttc gtccaggtaa 
agggtaacgc cttctctccc agcctcgata 
tgacgtattg cacatcccta agtttgctgg 
ctcggggttg gcctcaatgc cccgggcaga 
lacaccgaac acacacttat cggggtttaa 
agtttccgct agatctatga gtaacgtttc 
gacataagcc tcaatattac gtcctaattg 
aaaagtagga cctgcattct ttaacccgaa 
gggggtaatg aacgcagttt tttcctcatc 
agagtatgca tctagaaaac acaaaaggtc 
tatgcgaggc agggggtaag gatccttagg 
gcacatccga agcttgccgt tcgccttggg 
ggggttgacg ctgccatcat atttttcggc 
gacattccga agactcgcca caaaggctct 
gggctgattc ccgcgtccgt gttgaggtga 
gtgggcagca cttcggtcat tacgccggcg 
acgcagatct ttctgggtcg aaggagtcga 
ggctgccgct cgtcgtgccc (ccgtcttga 
ggccttggtg gcggaggacc gcccaccagc 
tttggccacg tcgtccagcc atcgttgggc 
gtgacgtaag agcgcgcccg cagcttggag 
gacgaggagg cgacgctccc catctcgccg 
cgcggatctt tcgaccctct cgagcgcctc 
cggtggagta cgagctcgac ggcgtgggtt 
agagaggtcg tgcgcctttg cttgctcggc 
cacggaagag tacgagagct cagaaaaaca 
tgacggagcg tggggctcct caccgggaga 



gggtcgccgc taagtagagt agcaagggct 14880 
gagagcttag atacgccttc aactgggtga 14940 
acggtccgga gcgttlgaga agcttaaata 15000 
tgaaccgact tagggcggcc atgcaaccgg 15060 
ggggcgcatc cgctctatag cccgtatctt 15120 
gaccaagaac ccgagaagct tgcccgcagg 15180 
ttttatgcgg gcggagcgga gactctcaaa 15240 
ctggttgcgc gtctttacaa ccaagtcalc 15300 
gctaccgaaa gaaattcgag tagtacgttg 15360 
gggcattgtc gtataacaat aggtlcclat 15420 
ctccctagcc atgcgaatct gatggtaacc 15480 
gcaccccgca gtggagtcga caatctgatc 15540 
acatgccttg ttaaggtcgg tgtagtcgat 15600 
aacgaccacc gggttcgcca gccactcggc 15660 
gatggtgggc cggaaccttg ggggccaacg 15720 
acagccgaca ccaccaaccg ggggcacgga 15780 
cactctggac gaggaagcgc cctccgttgc 15840 
ctcgatgctg gtgcgggcgt ccggcccccc 15900 
cgaaggagtg gcggccgaat ggcgaacagc 15960 
cgacgcggag ccggtggtag cagcaccaga 16020 
atctaggcgc tgccgtgccg tcatgactaa 16080 
tggagactcc gggtcaggga cgacaggcgg 16140 
cgcgccctgg ggcgtgctgc cgtcgccgta 16200 
ttcttctcca tcgcccgcga tcggtgaagt 16260 
cccccgctta ggactttggc atggagggag 16320 
cggctccccg tcgtcgccac tcacactcgg 16380 
catcaggctg aacaggaaaa gcttggcgca 16440 
cacactgagt cccctacctg gcgcgccaga 16500 
ccgcgcaggc ccccctttgc cggttcggcc 16560 
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ggggactcaa ggtgaaattc taagctctct 
gcataagaca cgggcgatgi atacaggttc 
tgtgttttgg gggatctgtg tatgaaggag 
tctgggtlcc gaatctggaa aagtccagtc 
gtcctccttt tatalcttaa ggggatacca 
cttaccctat cttttcataa atggacggag 
tgataggacg gcccataccl acctccactt 
ggctgtctgc tgacgacatg accagtgtca 
ccacgcgtca gtttagcaat ctacatgttg 
tggttaggat gaagcctggc atatatctaa 
acacgctagc tccagctggg gacgagcgcc 
aggcgtgcgt cagatcgcct gtcgccacct 
cacagaccgg ataaacgagt gcactgcact 
aaccacaata aatgtggtta ggtgagcccc 
caaaaaccca cgaggggtcg gggcgcctcg 
ccccctcggg gggactaaga ggagggcgaa 
gggtgccagg ccacgtgggc gatlgtgtct 
tcccatgtca ccgacaaggc catccgaatg 
accataatgc accaatgtgc atgaccacac 
gagggtgaag cctagcacca aacgaccgcc 
cggcagcaac acatagatga gacttcgaaa 
ggatgctgcc atcgtccatc taaaaagatg 
ggggagaggg taacccttga cagcgcccca 
ctgccggtcc ggtgaaccac cggactaggc 
agatcaccgt ccaccactca gaaccaccac 
acaccgcacc gacgcccctt cgtcggccga 
cccaggaccc ctccgttcca ccacccgccg 
cccgggactg ggtgacattg cttcggcagc 
tcaatccaac ctagaaactc cccgcaaaag 



gtatgtggaa ggtttgcgac cgtcgaaaga 16620 
gggccgctga gaagcgtaat accctactcc 16680 
ctacaaagta tgagccagcc Ictcccttgt 16740 
cagtccagtc cccccctcta aglgggcaag 16800 
catgcaccat ctccctcctt tctglggaga 16860 
atttgtatag ttgccgtccg aatgaccttc 16920 
ccgccgaaag caggtgcgac gtgggattat 16980 
gactgglcac aaattgctca ttcctgtcca 17040 
gcccttcttc acacaacalc ttgcctgtaa 17100 
ccaggactaa cgtgccatct ctaggaggta 17160 
tagaaaccct cgtcctgacg ggatggggcg 17220 
aacccgcgat ctgaccggtc tgtgactggt 17280 
tcgttacatg cggcgtgaca cgctcagcca 17340 
gctgtgctca cctaacccat acacgcggag 17400 
gccctcgggg ccgaggcggg tgcggtccga 17460 
cacatcaccc tcgggcccga cgtcccccga 17520 
gcctcaaacc tctagtcatg atactcctga 17580 
tattaaggag taaaagttac aagaaaaaac 17640 
accatacact acccccaagc acaaaccact 17700 
actaagtgtg accaaacgcc gctaggccla 17760 
acgatgccac caaggtggtc acgacatcla 17820 
tggttttcac ccagagaaac tcatcaagaa 17880 
aggaggttac gacgcccgaa ggcgtagccg 17940 
ttccgcctag gaccctatag ccttgatcgc 18000 
acagacaaaa ggtagcacgt agcttccacc 18060 
ctccatcgaa ccaccatccc tgagagctgg 18120 
gccgccttgc cagtlttggc caaaggagaa 18180 
ctgagcttcc cccgctggcg agctgctgtc 18240 
aaggggatga gctctaggaa gggcgagggt 18300 
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gccgaccggc aacgaggaag acaacccatc gactccagct ccctttgcac taccatctgg 18360 
cccigcgcca atgccggata cgctgtcgct ccggctccgg cgccacccac ctgcaccccc 18420 
tttgcctggt ctccgcgccc ctcctggctg cgtcgcgccg cccagctggc cgctaagggc 18480 
accgcgacgg ccgcccggcl accgaggcct ggccgcgcca Igggacagct cgcgctggca 18540 
ccagcgagcc acggccgtcg cgctgttgcc ggcgccagcg agcacaaccg ccagctccaa 18600 
gggccgagca tgccactgag ccgccgccgc tgccgcccgg gccggctgca cgtcaccggc 18660 
gcacacgacc gcacgccgcc acgctccgcc tccgcgcccg aggcagcccc atgccattgc 18720 
cgcgcacctc gcccgcccgc tgccgagccg ccaccgcgca ccttgctgag ccgccaccgc 18780 
cgtccctagc cgcctcgtgc cgccgccacg ccagatccag gcgcgggatg gccggatccg 18840 
gccttggggg cgccggatcc accgcctccc cacaccgcca cggcgtcacc acctccgacc 18900 
gcagtgaggg cttcgtcgtt tgccccatcc tcatcgcgtc gaggaggaag acgccaagaa 18960 
aaaagggcct cgccgctgcc ttccttgctc gctgccggct tcgccgccgg cgagctccgg 19020 
cggcggcgag gtgggggaga agaagtgggg agtgggcagc tagggttltt tcgcccccca 19080 
agccgcccgt gcgagagcga cggtgggggg gggggggact ttccaacctc ttccagtgtt 19140 
ctagttctcc acgttatgta actcaatttg tttaaccata gaaagtaaga aacctaccag 19200 
cgtgttaagc tctctttcat tccctttctt cttcctggtt ttgcttccat cacatgtcaa 19260 
gtgaagggtt cttaactacc attactccta cacatctaat ttttttctca gatctttcgc 19320 
aggtatatat tgatgctaca ttttatgatc ttaagataat ctccttcaca ttaccctctg 19380 
ctgaaacttt agcttgaacc gtcatcttca ccacaatttg agcccaattt gcacagagca 19440 
caacgagcaa tagcttgccc ttacgttcat tatttagcat gaactactac taactaccca 19500 
agaatcaata caccggttta ataacgccat tttatcacgt taatatatgt ttcattcaac 19560 
acaccggttt tggcacagtt gcaaacttgc aataaattct ttcctacttc tccatcccat 19620 
aatataacaa attggtalgt ctcgtctggt actaagttac tatattatga gatggaggga 19680 
gcacttcttt tcttccaaaa tataagaata tagtattgga ttagatatta tctagaltca 19740 
cgaattcgat taggttglct agatttatag ttgtatgtaa tgtataattc ggtaataggt 19800 
tattacctct caggatggag ggagtagttt tgactttttt tttcttataa atcgctttga 19860 
tttttatatt agtcaaattt tatcgagttt aactaagttt atagaaaaaa attagcaaca 19920 
tttaagcacc acactagttt cattaaattt agcatggaat atattttgat aatatatttg 19980 
ttctgtgtta aaaatgclgc tatatttttc tataaacgta gtcaaattta aataagttag 20040 
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actaaaaaaa atcaaaacga cttataatat gaaatggagg aagtagtaga ctataacaaa 20100 
tttaaaccgt gctttgattt tagagcatca ctaatatgtt agcaataatc tatccctaaa 20160 
atttattttt tttcctaaac tgaaaatagg aagtggaaat actcctccat ctaagagaga 20220 
gcctaaattc aataaaaaac taaaaaacta aaggtggatc cctctattaa actaccgcaa 20280 
aaaalUatg Uttttttct cttccacgcg cgcagaacag atatctcgat caagttagca 20340 
tgtaaaattt ttaaagagal accttatacg actccltccg tatttccaaa agcaaacgga 20400 
Ittaaaatct gaclcaaata aagatctata iatccaattt acatgacaca tgtttcgccg 20460 
aatttttata ttaataataa ttaatatttt taaaattaaa ttaltagcaa tttgtttgga 20520 
ggatltatca aaacaggatg gacgl tgt 1 1 ataacagcgt cUgacctag acgcgcttgc 20580 
aaactgcggc caccctttta tcacacaaat ttttgacaat ttgacacttt ccaaaaatta 20640 
attttataaa ttaaccgtga ccaaaactta tttaaaaatg atctttttgt tgagcgcaaa 20700 
atcgtatact tcagcgccaa atagcacggc gccgacctcc cccttcccct cccctctatc 20760 
ctccactgct gccgcccacc tctccgtatc agctgcgtcg cgttggtttc cgccggcgct 20820 
gctgctgctg caccagtccg ctagggcggg cgggcatggc gcgccgcgcc gcttcccgcg 20880 
tccgcgccgg cgctgttggc gcccttcgct cggagggctc gacccaaggg cgagggggcc 20940 
gcacgggggg cagtggcgcc gaggacgcac gccacgtgtt cgacgaattg ctccggcgtg 21000 
gcaggggcgc ctcgatctac ggcttgaact gcgccctcgc cgacgtcgcg cgtcacagcc 21060 
ccgcggccgc cgtgtcccgc tacaaccgca tggcccgagc cggcgccgac gaggtaactc 21120 
ccaacttgtg cacctacggc attctcatcg gttcctgctg ctgcgcgggc cgcttggacc 21180 
tcggtttcgc ggccttgggc aatgtcatta agaagggatt tagagtggat gccatcgcct 21240 
tcactcctct gctcaagggc clctgtgctg acaagaggac gagcgacgca atggacatag 21300 
tgctccgcag aatgacccag cttggctgca taccaaatgt ctlctcctac aatattcttc 21360 
tcaaggggct gtgtgatgag aacagaagcc aagaagctct cgagctgctc caaatgatgc 21420 
ctgatgatgg aggtgactgc ccacctgalg tggtgtcgta laccactgic atcaatggct 21480 
tcttcaagga gggggatctg gacaaagctt acgglacata ccatgaaatg ctggaccggg 21540 
ggattttacc aaatgttgtt acctacaacl ctattattgc tgcgttatgc aaggctcaag 21600 
ctatggacaa agccatggag gtacttacca gcatggttaa gaatggtgtc atgcctaalt 21660 
gcaggacgta taatagtatc gtgcatgggt attgctcttc agggcagccg aaagaggcla 21720 
ttggatttct caaaaagatg cacagtgatg gtgtcgaacc agatgttgtt act tataact 21780 
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cgctcatgga ttatctttgc aagaacggaa 
ctatgaccaa gaggggccta aagcctgaaa 
atgctaccaa aggagccctt gttgagatgc 
gtatccaccc taatcattat gttttcagca 
aagtagatca ggcaatgctt gtgttcagca 
cagtgaccta tggaacagtt ataggcatac 
tgcgttattt tgagcagatg atcgatgaaa 
ccctaattca tagtctctgt atctttgaca 
aaatgttgga tcgaggcatc tgtctggaca 
attgcaaaga agggagggtt atagaatctg 
gtgtgaagcc caatatcatt acgtacagta 
agatggatga agcaacgaag ttacttgcca 
gtgttacata taatactttg attaatggct 
tagttctttt tagggagatg gagagcagtg 
taattctgca aggtttattt caaaccagaa 
ggattaccga aagtggaacg cagcttgaac 
tttgcaaaaa caatctcact gacgaggcac 
atttacagct ggagactagg acttttaaca 
gaaatgatga agccaaggat ttgtttgcag 
ttaggaccta cagtttaatg gcagaaaatc 
atgatctatt tctttcaatg gaggagaatg 
ccattgttag gaaactgtta cagaggggtg 
tgattgatga gaagcacttc tccctcgaag 
tgtctggggg aaaatatcaa gaatatcata 
tagaatcttt gagctgctga agccttttgc 
ctcctacagt cgtattagag gagggatctt 
gtcacctctc cgaattattt ttactctggt 
gcctttgatg ccagaaaaaa cacaaaagtt 
gaatttgtaa ctggagtttc aaacttaatt 



gatgcacgga agctagaaag atgttcgatt 21840 
ttactaccta tggtaccctg cttcaggggt 21900 
atggtctctt ggatttgatg gtacgaaacg 21 960 
ttctaatatg tgcatacgct aaacaaggga 22020 
aaatgaggca gcaaggattg aatccggata 22080 
tttgcaagtc aggcagagta gaagatgcia 22140 
gactaagccc tggcaacatt gtttataact 22200 
aalgggacaa ggctaaagag ttaattctlg 22260 
ctattttctt taattcaata attgacaglc 22320 
aaaaactctt tgacctgatg gtacgtattg 22380 
ctctcatcga tggatattgc ttggcaggta 22440 
gcatggtctc agttggaatg aaacctgalt 22500 
actgtaaaat tagcaggatg gaagatgcgt 22560 
gtgttagtcc tgatattatt acgtataata 22620 
gaactgctgc tgcaaaagaa ctctatgtcg 22680 
ttagcacata caacataatc cttcatgggc 22740 
ttcgaatgtt tcagaaccta tgtttgacgg 22800 
ttatgattgg tgcattgctt aaagttggca 22860 
ctctctcggc taacggttta gtgccagatg 22920 
ttatagagca ggggttgcta gaagaattgg 22980 
gctgtactgc caactcccgc atgctaaalt 23040 
atataaccag ggctggcact tacctgttca 23100 
catccactgc ttccttgttt ttagatctlt 23160 
ggtttctccc tgaaaaatat aagtccttla 23220 
agctttgaaa ttctgtgttg gagttctltt 23280 
ctctttatgt gtaaatagcg aggtatgtat 23340 
tcctagacgg taaacaagca attatgttct 23400 
tgtcgttatc tctactaacg gatcataaag 23460 
tgtctaggca gtagttttgg cattagatcc 23520 
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aacallglgt aggattcatt tgtgtgtatc 
tgtgtactgt ttagglgttg aatagtttga 
aatcgttcca ttcaacaaac acatgttccg 
tttcttactc acaagtgtcc tcttttctta 
attcaatttt ggttttgaac attgatgatc 
tattcatttt gtgaagtgat gtgagtacct 
cttcattcca attttttacg catatcgatt 
aatcacatca tgacaaagtt atttatttct 
ttccatgaat attttggcat gtgattacaa 
atttcatcat tttgtttgaa gttgaacttt 
ctagcagctc gtactcttta atggtataac 
ttcttgttgt tcaggttcaa gaaaatcatt 
ttcttaaatt attaaatcct ccagtgaatc 
ccaaacttct tggaatcagt aaagttcaaa 
gcatttcaga ggaaatcctt tcaaaagttg 
agtcgctcga caaagatgac aagaaaacaa 
ctttgttgac aatggaaaat gcatgcttgt 
caaatttctt ggtcatgatt tccctttatg 
agcagttgtc aacttgtcat aagttacata 
gcttagccct tttcttcacc tgcttttgat 
attggagtag tggactgcac tagaagcacc 
ccttatcaaa tatttgattg ttacagagac 
ttttaaagag acattaaggg agatgggagg 
ttttcattca acattggaga tgagatctcg 
tcaactctaa tagattgtgc aggcttgltc 
aagccacatg aaagtaatgc ttgtccagat 
atggaaaatg cttgaggtag taactatttt 
aaattatgtt gttltatgtt gcaaaaafag 
tgttagtatc ttacagttcc tctgatgatt 



aatctatagg gtttcattaa atttcgttta 23580 
cttgtttttt aactgaacaa aagatactga 23640 
ttaatgaaat t a t tgtacgt taccttttgt 23700 
tatcctatag atlggtacaa caaattattg 23760 
ctccctgcac tattggtgca gctgctcttc 23820 
ctcaatccca tccttatgct tctgtgcatg 23880 
gttttctttt atataacagt ccataaagat 23940 
acagtatagt tatataagta ttcaccagtt 24000 
agaagattat Ugagaaaat ccatgctttt 24060 
aatttatggt gtaaatttca gttattattg 24120 
ttcacttgtg cttattctcc aatatctccc 24180 
tgttggattc agaatctggt gtccattttc 24240 
ttgttgattc caaagcacca tcgataggtt 24300 
tgcttaatgg atcaaataag gattctgact 24360 
aagagattct cttaagctgt caagtgatca 24420 
caaggccaga acigtgtcca aagtggcttg 24480 
ctgctgtttc agtagagggt aagttttaat 24540 
accattatat ttalttatat gagccaaata 24600 
gcacctattt gcaatattca tgggtggttt 24660 
tgatgacttc calctgtgtt gcagaattga 24720 
tatggccalt gtcatactag gaaggttttc 24780 
ttctgacaca gtgtccagag ttggaggaaa 24840 
tcttgatagt atttttgacg ttatggtgga 24900 
ctaacatcgc atattttaca tttcctttgt 24960 
cttttcgcca ttttagcttt aatgcgcttg 25020 
acatagccaa aggttgttat attttggggc 25080 
catcaggaca tggaaaattg gctgcaacac 25140 
ttttttaata cttttttatt ctgcatgtgg 25200 
atalccccca cgataataac acttgaaacg 25260 
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ataataacac ttgacatatc tacaccaagt gaacattatt catttggatg ttacttttcc 25320 
agctatactt gctgitcttg catgtgtaag caagtttgga gtaaattgcg cat taat 1 1 a 25380 
aatgcttggt gttcctatct gtgtactttt tattccccaa ctaataatgc aatcatatta 25440 
cgctgataaa cigaataaat aaattaacaa tatacttclg gtggcaaacc ttgtgtatca 25500 
gaatctcata aaggatacat ccacttcagc tttggaccga aatgaaggaa catctttgca 25560 
aagtgctgct ctcctcttga aatgtttgaa aatattggaa aatgccatat ttctaagcga 25620 
tgataacaag gtaatgctcc ttatatgttc tgttlcagtt tagtacccat ttccttcttc 25680 
tgtactatct tctctcctga tttgttctgt gcaaaalgtg caaacagtgc gactttgtat 25740 
gtctgcttaa caattttctt t tct tcctga aaaagcaata tgaactctta cat teat 1 1 1 25800 
gettcttgea gacccatttg cttaatatga gtagaaaatt gaacccgaaa cgctccttgc 25860 
tttcttttgt tggtgtcatt atcaatacta ttgagttatt atcaggtatt tttcttaata 25920 
atacaatgtg ttegctaaca caataaaatg ttttaaacat ccagtatgtt aaagttgcag 25980 
tctgacgcct attttgtttl gctgcagctc tttcaatact tcagaattct tctgttgttt 26040 
ccagctctac atatccgaaa tegtctaaag tctctcaaca gagttactct ggtaataaca 26100 
aacaccaatt ttgtttgatc agttgatctc gttggctttt etatgeactg tctcaatata 26160 
gtttggtcgc cattcaagtc tcactacaga tgttgaactt ggcctgacac caaatattta 26220 
taaaatgeta cctgatattt ttaatatttc atgtttcctg acccagatta tcttgttggt 26280 
tectegtata agtttaatta gtgacattct tgaagctttg ttatgeagea gatgtcatgg 26340 
ggggaacttc atttaatgat ggaaagagca agaactcgaa aaaaaaaaac ttttgtcgaa 26400 
ccagacacgt cattgttgct tatcttcaaa atcagaagtt tctcatatta ctatatcttc 26460 
tggtagtgat gctggtctgl cacagaaggc attcaattgt tctccattta tatcaagcaa 26520 
tggggcatca agtggttcat taggegagag gcacagcaat ggtagtggtt tgaagttgaa 26580 
tataaaaaag gatcgtggca atgeaaatec aattagaggc tcaactggat ggatttcaat 26640 
aagagegcac agttctgatg ggaactccag agaaatggca aaaagactcc gtctatctta 26700 
aaatgtaatc accgacagtg gtggtggtga tgaccctttt gcatttgacc gccgcgtcgg 26760 
cgtcgccacc aegtaatege ccacgtcgct gcccccgctg ccacgtcgtc gaccgcgcac 26820 
ggtaatcaca cgcatctcga ggccgccgct agctgatatc ttctcatccg gttgatltgt 26880 
gattttggcg tttttgcagt ggtgatggcg gggggegace gtggccgagg cgtggagtgc 26940 
catccgcatc agggtgtatc ggccgcgctg ctccgccctg gtccgcaggc tttggcggcg 27000 
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agctggcggc ggagggagac tgtggtgaga 
atgggggatt cgccgcaggc gctctcaggt 
tatttltttt Ictcgcaaaa tgtgltgtga 
attaatgtag tttgctggaa catltacatt 
tgtggaattg tggaggggag aaaaatcatt 
aattttaaaa ctgaaggtgt ggaaatcaaa 
aaccaccatg atatattgtt ggttataaca 
atagaatgtt tagtataaat tgaggagaac 
gtagcagcta atactagcta Ngtgcattt 
atttcttact aagalgtatg atgcaaaaca 
tgtgcggatg caaataaaat catgtacatg 
agaattgcat ttcttatgtg gttacttt tg 
atttcaaggg tcagaggggt ttgcttctac 
tttgtcgcca tggttgtggt tcatccactt 
gttcaggaaa aaaatttgaa aatacccatt 
agtatgaaga atttgcgaaa atataactaa 
aaagtatggc gtggcggcgc ggagtggacg 
ggaccgagcc agtctcgctt gccggtaacg 
tgcggaaccg cggcgccaac atttttttac 
actgtttctg gtactgtttt acacagttcc 
accgaccatg aacaatgtgt gaacagtgct 
ctgcacagtg ctggagtcgc tggccactgc 
gcgattccgc gttttggagc tgccggacca 
tattttgaat ctgcggaacc gtcgctgtcc 
tltataaaac ctciccatgc algtatataa 
gcaaattttt ttcgagagct cagcactaca 
ccatgttcta caagcttgac gtcgagggaa 
ctatctatgt tgcgaaccat cagagttttt 
ggtgtttcaa gtttataagc aagacaagta 



tcggatttcg ccgctggtgg tgtcgctacc 27060 
ttgcagcctc clccactctc ttcccttttt 27120 
tgltcgtctc gctgggctgg cctcatagcc 27180 
tggaacgttg ttggcaattg ctttacaaaa 27240 
tgaacctgca gtgacaaaat tgccatctct 27300 
cataatcatt gccagcgcat cattcttgtt 27360 
gttagctcca caccaacctt gaaggtgtca 27420 
aggcagttgt taagactttc taaagaactt 27480 
gtgtttcatg gaatttgagc agcaatggat 27540 
aaaaactatg tctatacagt ttacatgtaa 27600 
gacaaactca tgggattcat accgaattcc 27660 
ttgttgattt ggttaccaga catcgatgtg 27720 
gcggtggctg cagttgcagc aatctttttg 27780 
gtgclcctat ttgaccgata ccggaggaga 27840 
ttltgaaaaa gatttacgtt tatatacact 27900 
tccgcagatc ggttatgcgg gagcgcaaca 27960 
gccgaggcgt tcgcgcggaa tggggctgcg 28020 
cggaaccggt acgctcccgc agcgccagtg 28080 
tgcatggcac tgtgtttaat actgtttgac 28140 
cgggtcagtt ccgcacaatg gaggcgcggc 28200 
gcacagggtt aaaacagtgt ataaactgcg 28260 
ggttccgcgt tttggaaccg cgggaccgtc 28320 
tgacggttcc gcgcaggatc gtcggtcccg 28380 
cgcgtttcca tttcgcggga tgcgtatatt 28440 
acataaatta ttgaaaaaat aagtatattt 28500 
ttgcaaagat ttgggcaact ctgacaatlt 28560 
tggagaacct gccaccgaat agtagccctg 28620 
tggatatcta tacccttcta actctaggaa 28680 
tatttatgtt ccgaattatt tgatgggcaa 28740 
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(glatctctt aggaglaatt cctttgcggc gtatggacag caggagccag ctggtatggc 28800 
tgtagtctca tccctgcttt cttaaglaga catatatgca attacagaat ttggtaaaca 28860 
aacaagattt tatgaatcat atatgatttt ggggaaaaca ccaaactctc tttggtggct 28920 
gccttgaaca tagttctatt cacacagtta tagcaccttc tttaaaatga agaactttgt 28980 
tgcatacaca tatggccaaa ccacataatg aattttgttt atttctatct ttgaatgtta 29040 
gcaccttatt ttcatgcata tcatgctaat ttgcttgccc acgttgagtg ggaatttttt 29100 
tccatgtttt ataatltata tatgttclag acttctagtc cacaatttat ctacttcatg 29160 
ttcctgagcc tctagtatgg ctggtagcag actaggtgct gagtgctgtc catttttgca 29220 
gactgaagag aggagaaata caggac (gtc cgttgttagt cagalttgta aaaatagact 29280 
ctgatgtagt ttattttagc ccctatttta tatttaacaa tacaaatata taacgtatcc 29340 
taagaactta tcgtaattta ggagaagttg ctcgtttcat taaattaaac tgtgaagtaa 29400 
aaatgtgtgc tcgagtctgt caatgcaatc ctgtgttctt gtttgaagat atggtgtagg 29460 
gcaggctagg atcgaacact gaatggtaag actgcttctg ccttcatttg tgcacttggt 29520 
gctgccacgc cgattaagca gtagaacaaa gtaattttgt cgtgcacaaa tgagttatat 29580 
ttcatlgaaa atcgaagtga aaatgaacca aaagatagaa gaaaagggga aacttggtaa 29640 
ttatatactc cacaaattta ttggtaagat ttgatattag acgctcgatt acttggctta 29700 
agttaaggat atcaaatttg gggaagcacc aaaggaatta ttgtgaagga gttgtgggtg 29760 
cataacgtta t'ctactagtt caaatcctag tgactatgaa lattaatgag taaggtaagg 29820 
gatttattgt taattttagt ttctttaaga ttgtglccga gtacaccatt cggtaagtgt 29880 
aataatgttt tgtattggat icacttgtgt tacgtgcatg tgcttttacc ttttcatttg 29940 
tttctgcgit ctgggtatga atttgacgag attccatggt cagctcaaca tatcagttac 30000 
tgcgtgtcaa gcgatcttat atggtatgcg cacaagcgat tgtatacgga tatgacagta 30060 
taatgtgtga tattgatacg atgttccttt cctttataaa ggaacaaaga ctttttttaa 30120 
aaaaagaagg ggtat tacta aaaaccaaaa tgtcaaaaac aaaatatcag tgcacatggc 30180 
aagtgtgcac gagcaatagc Ugcccltac gttcattati lagcatglac tactaclaac 30240 
tacgcaaaaa tcaattcacc gattattaaa ctgttaacat cattttagca cgttaacata 30300 
tgtttcattc aacacaccgg ttttggcaca Utacaaact tgcaaagttg caatactccc 30360 
ttcgttacat agcataagag attttaggtg aatgtgacac atctatccaa attcattata 30420 
ctagaatgta tcaccgcctc cacgccggga gggagagcgc cgccggtgga gaaaggggga 30480 
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gggagtggtc gaggggaacc agtagggtgc 
ccggcgagac aggaggaaga gggggagatg 
gcgggggggg ggggggggga gcggcgacgc 
tgagagagat aggggagagg gaaaatgatt 
tatatagatc gggatcaatc aggaccgtcc 
gttgggccgg gtgccactcc taggttgccc 
gaaataagtt cactttaggt cctttaagtt 
tattgggcca ccccataggc catgtgtacg 
cttaatttgt cccctcaaac ttctaaaacc 
ttgcaactca cgggcatatt tgctagtgac 
tatagaattt aaactgtgct ttcattttag 
ttaaataaat gcagaaatga tgtttttatt 
tcaaaaaaaa attaaaagag atttattaat 
accatatcat attatttcac ggtccgtgat 
gggagagaag gggcattgtt tatttttata 
aaataaatat aatcattatc ataacattag 
ttcagtacaa taaaaaaata aaagtgggcc 
ccaagagcgc cgcacacaac tgcgcaagag 
tctcctacac catccctgcc atctccttcc 
tagggcgggc atggcgcgcc gcgccgcttc 
ctcgatccaa gggcgagggg gccgcgcggg 
ccacgtgttc gacgaattgc tccgtcgtgg 
tctcaacggg ctgtgtgatg agaacagaag 
ggctgatgat ggaggtgact gcccacctga 
cttcttcaag gagggggatc tggacaaaac 
gaggatttcg ccaaatgUg tgacctacaa 
aactgtggac aaggccatgg aggtacttac 
ttgcatgaca tataatagta ttgtgcatgg 
tattgtattt ctcaaaaaga tgcgcaglga 



cctccccgtc gccgcctccc cgtggccgcg 30540 
gagcggcgcc gccggtgagg gcgcgcgtgc 30600 
cggtgaggaa gggaagggga gtggtggclt 30660 
ttagagttag ggtttgggct gctgagtttt 30720 
atcagatcgg acaactacgg tttctcccgc 30780 
acactatlgg gccacatgta cgctccgcgt 30840 
gcctctgaat tgttcccagg ccggccgcac 30900 
ctccgcacag aataatttcg ctttagctcc 30960 
agtgcaaatc tttaattttt agttcaccca 31020 
atataatatg aaacgaagga tgtagcagac 31080 
agcatcacta actgttattt agatttttat 31140 
atgaaaatta gcaataaagc tcccaaaatt 31200 
catggttaat ttaattaaaa attaaatcta 31260 
gaggaaatgg cagctgctat cacttatggt 31320 
actatctctt ataactccca tgaaactata 31380 
tttttttcca ttgcaacgca agggtaattt 31440 
attctgaacg gaaatttctg gtttttttlc 31500 
atcgatcgcg atcaccctgc (cgtcgccga 31560 
cctccactgg ctgctgctgc acctgtcagc 31620 
ccgcgctgcl ggcgcccttc gctcggaggg 31680 
gggcagtggc gglggcgcgg aggacgcacg 31740 
cataccagat gtcttctcct acaatattct 31800 
ccaagaagct ctcgagctac tgcacataat 31860 
tgtggtgtcg tacagcaccg tcatcaatgg 31920 
ltacagtaca tacaatgaaa tgcttgacca 31980 
ctctattatt gctgcgctat gcaaggctca 32040 
caccatggtt aagagtggtg tcatgcctga 32100 
gtttlgctct tcagggcagc cgaaagaggc 32160 
tggtgtcgaa ccagatgttg ttacttataa 32220 
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ctcgctcatg gattatcttt gcaagaacgg aagatgcacg gaagcaagaa agatttttga 32280 
ttctatgacc aagaggggcc taaagcclga aattactacc tatggtaccc tgcttcaggg 32340 
gtatgctacc aaaggagccc ttgttgagat gcatgglctc ttggatttga tggtacgaaa 32400 
cggtatccac cctaatcalt atgttttcag caltctagta tgtgcatacg ctaaacaaga 32460 
gaaagtagaa gaggcaatgc ttgtgttcag caaaatgagg cagcaaggat tgaatccgaa 32520 
tgcagtgacg tatggagcag ttataggcat actttgcaag Icaggcagag tagaagatgc 32580 
tatgctttat tttgagcaga tgatcgatga aggactaagc cctggcaaca ttgtttataa 32640 
ctccctaatt catggtttgt gcacctgtaa caaatgggag agagctgaag agttaattct 32700 
tgaaalgttg gatcgaggca tctgtctgaa cactat 1 1 tc tttaattcaa (aattgacag 32760 
tcattgcaaa gaagggaggg ttatagaatc tgaaaaactc tttgacctga tggtacgtat 32820 
tggtgtgaag cccgatatca ttacgtacag tactctcatc gatggatatt gcttggcagg 32880 
taagatggat gaagcaacga agttacttgc cagcatggtc tcagttggaa tgaaacctga 32940 
ttgtgttaca tatagtactt tgattaatgg ctactgtaaa attagcagga tgaaagatgc 33000 
gttagttctt tttagggaga tggagagcag tggtgttagt cctgatatta ttacgtataa 33060 
tataattctg caaggtttat ttcaaaccag aagaactgct gctgcaaaag aactctatgt 33120 
cgggattacc aaaagtggaa ggcagcttga acttagcaca tacaacataa tccttcatgg 33180 
actttgcaaa aacaaactca ctgatgatgc acttcggatg tttcagaacc tatgttlgat 33240 
ggatttgaag cttgaggcla ggactttcaa cattatgatt gatgcattgc ttaaagttgg 33300 
cagaaalgat gaagccaagg at ttgtttgt Igclttclcg tctaacggtt lagtgccgaa 33360 
ttattggacg tacaggttga tggctgaaaa tattatagga caggggttgc tagaagaatt 33420 
ggatcaactc tttctttcaa tggaggacaa tggctgtact gttgaclctg gcatgclaaa 33480 
tttcattgtt agggaactgl tgcagagagg tgagataacc agggctggca cttacctttc 33540 
catgattgat gagaagcact tttccclcga agcatccact gcttccttgt Uatagatct 33600 
tttgtctggg ggaaaatalc aagaatatca tagatttctc cctgaaaaat acaagtcctt 33660 
tatagaatct ttgagctgcl gaagcatttt gcagctttga aattctgtgt tggaattctt 33720 
ttctcctaca gtccgattag aggagggatc Itctctgtat gtgtaaatag cgaggtatgt 33780 
atgtcacctc tccgaattat tttgactglg gttcctggac tgtaaacaag ctattatctt 33840 
ctggtgttga tgccagaaaa aacacaaaag Utgtcgtta tctctactaa cggatcataa 33900 
agggglltgt aaclggagll tcaaactt.aa ggtatctagg caglaggtat atatlgatcc 33960 
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tacatcttat gatcttaaga tgatatcctt ctcattatcc tclgctgaaa ctttagctlg 34020 
aaccgtcatc tacaccacaa tttgagcccc ttagcacaga gcacaacgag caatagcttg 34080 
cccttacgtt cattatttag catgcactac tactaactac ccaataalca atacatcggt 34140 
tattaaactg tttgtacagt ttaataatgt cattttatca cgttaacaia tgtttcattc 34200 
aacaccacac cggttttggc acagttgcaa acttgcaata acatttttac tacttctccg 34260 
ccccataata taacaatctc gttccatact atattgctat attacaggat ggalgaagta 34320 
cttcttttct tccaaaatat aagaatctag tactagatta gatattattt ggattcacga 34380 
atttgattag gctgtctaga tttgtagtcg tatgtaatgt ctaattcggt aataggttat 34440 
tacctctttg gatggaggga gtagttttta tttcgtactc cctccgtttc atattataag 34500 
ttgttttgac ttttttctta gtcaaatttt attgagtttg attaaatlta tagaaaaaaa 34560 
ttagcaacat ttaagcacca cattagtttc attaaatgta gcatggaata tatttttata 34620 
atatgtttgt tttttattaa aatgctacta tatttttcta taaatgtagt caaalttaaa 34680 
gaagtttgat tatgaaaaaa tcaaaatgac atataatatg aaactgagga tgtagcagac 34740 
tatagcaaat ttaaactatg cttttatttt agagcatcac caaaagatta gcaataattt 34800 
atccctaaaa ttcaagtttt gggtttctta aactgaaaat aggaagtgaa aaatcttttc 34860 
cgtccaagag atagcctaaa tcttatctta actaattaaa atattcataa ttttcctttc 34920 
gtcacattaa attttcgtcc gtaaatctga ttgaaatcca attggacaat ccaaaaaata 34980 
gagaaaaaga acagaaaaaa taataaaaag cacacaaatc ttatctcaat cccgcgggaa 35040 
gctgccgacg ccgccgaatc cgctcgagcg ccgccgccgc cgctcacggg gaacgatgtc 35100 
gctgclgtcg cacgcggtat gggagggcgc cgctgccact gcttgggaga taggatatgg 35160 
agagagaagg aaatgtgagg gttagggtta ggttiltccc cgtccgtatc ttcagcgaca 35220 
cggaggcgat ccaagctgtc catcagatcg gacggctcag aatgcctcca tcgtcgggcc 35280 
gcgcatgctt gatgggccga gggaaggccg gagggtcgaa caaacgcaat caaaggagga 35340 
gttggaggag glaaattaga atttatttgc gggctgagal agtaaatgga ctgaaaalgg 35400 
cccatagaga aattgggaat tttatttaaa taaatgttga aaaggtgttt atat tatcaa 35460 
aattaaaaat taagctccga aaattctaaa aaatattcaa agagcattat taatcatggt 35520 
taatttaata aaaattaaat ccaaccatat catattattt cacggcgcgc gglaggaaaa 35580 
tgcgcagctg ttgtcgttta cggtgggaga gaagggacat tgtttatttc cagaactatc 35640 
ttttataact cccatggaac tttaaaataa atataatcat tattatagca ttagtttttt 35700 
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tctgtctttt ttttccccaa gagcgccgcg 
cgacgtcgcc ggccgatctc tcattctctc 
catccctgcc atctcctcct tcccctcccc 
cgtataagac aaactgcgtt gcggcgttgg 
gctagggcag gcatggcgcg ccgcgccgct 
ggctcgatcc aagggcgagg aggccgcgcg 
gtgttcgagg aattgctccg gcgtggcagg 
ctcgccgacg tcgcgcgtca cagccccgcg 
cgagccggcg ccggcaaggl aactcccacc 
tgctgccgcg cgggccgctt ggacctcggt 
ggatttagag tggaagccat caccttcact 
aggacgagcg acgcaatgga catagtgctc 
gatgttttct cctgcaccat tcttctcaag 
gctctcgagc tgctgcacat gatggctgat 
gtgtcgtata ccactgtcat caatggcttc 
agtacatacc atgaaatgct tgatcggagg 
attattgctg cgttatgcaa gggtcaagct 
atggttaaga atggtgtcat gcctaattgc 
tgctcttcag agcagccgaa agaggctatt 
gtcgaaccag atgttgttac ttataactcg 
tccaccgaag ctagaaagat ttttgattct 
gctacctatt gtaccctgct tcaggggtat 
gctctcttgg atttgatggt acgaaacggc 
ctaatatgtg catacgctaa acaagagaaa 
atgaggcagc atggattgaa tccgaatgta 
tgcaagtcag gcagtgtaga cgatgctatg 
ctaaccccta acattattgt gtatacclcc 
tgggacaagg ctgaagagtt aattcttgaa 
attttcttta attcaataat tcacagtcat 



cagaagagat cgatcgcgat ctccctgccc 35760 
cacgccctgc tcgtcgccga tctcctacac 35820 
tctatcctcc actggtgccg cccacctclc 35880 
tttccgccgg cgctgctgct gcacctgtca 35940 
tcccgcgctg ttggcgccct tcgctcggac 36000 
gggggcagtg gcgccgagga cgcacgccac 36060 
ggcgcctcga tctacggctt gaaccgcgcc 36120 
gccgccgtgt cccgctacaa ccgcatggcc 36180 
gtgcacacct atggcattct calcggttgc 36240 
ttcgcggcct tgggcaatgt cgtcaagaag 36300 
cctctgctca agggcctctg tgccgacaag 36360 
cgcagaatga ccgagctcag ctgcatgcca 36420 
ggtctgtgtg atgagaacag aagccaagaa 36480 
gatcgaggag gaggtagcgc acctgatgtg 36540 
Itcaaagagg gggatlcaga caaagcttac 36600 
atttcaccag atgttgtgac ttacagctct 36660 
atggacaaag ccatggaggt acttaccacg 36720 
atgacatata atagtattct gcatggatat 36780 
ggatttctca aaaagatgcg cagtgatggt 36840 
ctcatggatt atctttgcaa gaacggaaga 36900 
atgaccaaga ggggcctaga gcctgatatt 36960 
gctaccaaag gagcccttgt tgagatgcat 37020 
atccaccctg atcatcatgt attcaacatt 37080 
gtagatgagg caatgcttgt attcagcaaa 37140 
gtgacgtatg gagcagttat aggcatactt 37200 
ctttattttg agcagatgat cgatgaagga 37260 
ctaattcata glctctgtat ctttgacaaa 37320 
atgttggatc gaggcatctg tctgaacact 37380 
tgcaaagaag ggagggttat agaalctgaa 37440 
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aaactctttg acctgatggt acgtattggt gtgaagccca atgtcattac gtacagtact 37500 
ctcatcgatg gatattgctt ggcaggtaag atggatgaag caacgaagtt actctccagc 37560 
atgttctcag ttggaatgaa acctgattgt gttacatata atacttlgat taatggctac 37620 
tgtagagtta gcaggatgga tgacgcatta gctcttttca aagagatggt gagcagtggt 37680 
gttagtccta atattattac gtataacata attctgcaag gtttatttca taccagaaga 37740 
actgctgctg caaaagaact ctatgtcggg attaccaaaa gtggaacgca gcttgaactt 37800 
agcacataca acataatcct tcatgggctt tgcaaaaaca atctcactga cgaggcactt 37860 
cgaatgtttc agaacctatg tttgacggat ttacagctgg agactaggac ttltaacatt 37920 
atgattggtg cattgcttaa agttggcaga aatgatgaag ccaaggat tt gtttgcagct 37980 
ctctcggcta acggtttagt gccagatgtt aggacctaca gtttaatggc agaaaatctt 38040 
atagagcagg ggt tgctaga agaattggat gatctatttc tttcaatgga ggagaatggc 38100 
tgtactgcca actcccgcat gctaaattcc attgttagga aactgttaca gaggggtgat 38160 
ataaccaggg clggcactta cctttccatg attgatgaga agcacttttc cctcgaagca 38220 
tccactgctt ccttgttata gatcttttgt ctgggggaaa atatcaagaa talcatagat 38280 
ttctccctga aaaatacaag tcctttatag aatctttgag ctgctgaagc attltgcagc 38340 
tttgaaattc tgtgttggaa ttcttltctc ctacagtccg attagaggag ggatcttctc 38400 
tgtatgtgta aatagcgagg tatgtatgtc acctctccga attattttga ctgtggttcc 38460 
tggactgtaa acaagctatt atcltctggt gttgatgcca gaaaaaacac aaaagtttgt 38520 
cgttatctct actaacggat cataaagggg tttgtaactg gagtttcaaa cltaaggtat 38580 
ctaggcagta gtttlgacal tagatccaac attgtgtagt attcatttgt gtgtatcaat 38640 
ctatagggtt tcattaaatt tcatttgtgt actgtttagg tgttgaatat atigttttac 38700 
ttgtttttta actgaacaaa agatagctga agctttgttc tttaccaaat gcagtagtga 38760 
tcatcacaat atattttttt acggaacagg agatlgtata aaatggtttc catcggcggc 38820 
caacggcgac cgctctgctc tgacccacca cccaatccat ccatccactc gccgccgccc 38880 
ctgatccaag cctccgccgc gcgacagcga cgcaccgccg tcgagaggag gaggcgtgag 38940 
ccccatgggg accctcctcc ggccgcgtaa tgccgctgca cggiaaccac gcgcctctcg 39000 
aggcctccgc cgctagctga tctcttctca tcctgtttgg gtttgggttt gtgatttggg 39060 
tgttttttcc gcagcggtgg tggtggtggt ggttgcggcg ggagggggcg gtggccgcgg 39120 
ccgtggcgtg gagtgccagc Igcatcgggt gcaccgccgc cggggtccgc aggttgtggt 39180 
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ggcgacggcg agctgaggag gcggagggag actggtgagg gacacaggca ggcaggctct 39240 
caaggctaag ct tgt tacag gtactgagac tagttactaa ttactttgat aatcagtata 39300 
aataagcttg tgtagtgtaa tggcattgtg catttctgca cttgtaaatt ttacagaaga 39360 
tggtcattca at t igaacct gcatctaata ttttagtggt ttgagtttat tclcccagtc 39420 
acagagttga agaggcaagt aacctgtaag agaggactga acattaacac ctcttgttcg 39480 
attaaaaatg accaaagagc atcaaacatg tattcgaggc tgttacttta atatggccca 39540 
ttaatttgtt tagttggcta tgtacatcct agttggtgca gtgttgtgga aaacggaata 39600 
cgggtgtcgg atggacgagg tgccgtcaag cgattaatcg taatacggat galtaaacgg 39660 
aattatatgg atttttggcg ttcgcactaa gatgtacata attgatgtta atggcaatgg 39720 
tggagacaaa atgcatcatc ttaataaaaa atatttgtat aaatctctaa ctatattatg 39780 
aaaatgccat ttattagttc aatagatatc aacactgatg gttagtagcg caatagcatt 39840 
gggcttgtta gtcaaaatag tgcagctggg ctgcaagttg caagtttatg ttagtttcat 39900 
aaacagacat clgatttgtc gataaataac cgactaatcg tgccatacaa ctgtataatt 39960 
actctgaaat agtaatgttg ctccgacttg atgatacggt acggtclggc taccgtttcc 40020 
gttttgacag acgattaaac ggctgtgccg gtcgacttcc acaacactga gttggtgtaa 40080 
atgccagtta ccatttctat gaictaaaat aatcaactct tttagtatat tttcaaaaac 40140 
gaaaattcag tacacatgca tgaatcttaa tcttcatatc tagctcgtta caaaatcaac 4O2O0 
aaaggcaccg tgtcagctgg tgcacattag ctagttcgta cttagcatta tccactagca 40260 
ccttattttc algcatatca tgctaatttg cttgcccacg ttgagtggga atltttttcc 40320 
atgttttata atttatatat gttctagact tctacttcat gttcctgagc ctctagtatg 40380 
gctggtagca gactaggtgc tgaatgctgt ccttttttgc agactgaaga gaggagaaat 40440 
acaagactgt ccgttgttag tcagatttgt aaaaatagac actgatgtag tttatttttg 40500 
cccctatttt atatttaaca atacaaatat ataacgtatc ctaagaattt atcgtaattt 40560 
aggagaagtt gctcgtttca ttaaattaaa ttgggaagta aaaatglgtg ctcgagtatg 40620 
tcaatgcaat cctgtgttct tgtttgaaga tatggtgtag ggcaggccag gattgaacac 40680 
tgaatggtaa gactgcttct gctttcagac gttattgcta aatttttagc (agttgcaat 40740 
lagtgctgtc acgccgatta agcagtagaa caaagtaatt ttgtcgtgac aaatgagtta 40800 
tatttctttg aaaalcgaag cgaaaacgaa ccaaaagata gaagaaaagg gaaacttggt 40860 
aattactcca caaagagaac aaatttattg gtaagatttg atatgagatg c tcgal tact 40920 
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tggcttaagt taacaatatc aaat t tgggg 
taaagatatc aaat t tgggg aagcaccaaa 
aacgttattt gctttgttca. aatcctagtg 
aaggaattta ttgttaattt taggttcttt 
ctgtaataat gttttgtatt ggattcactt 
taccttttca tttgtttgtg cgttctgcgt 
aacatgtcag ttactgcgtg tcaagcagtt 
cgcacaagcg attgtatacg gatatgacag 
aaaaaatacg atgttactit ccttcataaa 
aggggtatta ctaaaaacaa aaatgtcaaa 
ctcggcaatt ttttgtctgt actttaaaca 
ggtgtcacaa atattttaaa ttagccaaac 
ataatttata ctctaaaagg ttgtgtacat 
caaacattaa tccacgttat ataagtcaat 
tactagcgtg ttaagctaag ctctctttca 
tcacttgtca agtgaagggt tcttaactac 
agatctttcg taggtatata ttgatcctac 
attatcctct gctgaaactt tagctlgaac 
gcacagagca caacgagcaa tagcttgccc 
taactaccca ataatcaata catcggttat 
tttatcacgt taacatatgt ttcattcaac 
tgcaataaca tttttactac ttctccaccc 
ttgctatatt acgggacgga tgaagtactt 
tagattagat attatttgga ttcacgaatt 
gtaatgtcta at tcggtaat aggttattac 
cgtactccct ccgtttcata ttataagttg 
gagtttgact aaatttatag aaaaaaatta 
aaatgtagca tggaatatat ttttataata 
tttttctata aatgtagcca aatttaaaga 



aagcaccaaa agaattattg tgacttaagt 40980 
ggaattattg tgatggagtt gtgggtgcat 41040 
actatgaata tgaatattaa tgcgtaaggt 41100 
acgattgtgt ccggggacgc cattcggtaa 41160 
gtgttacatg cacgcactaa acatgtgctt 41220 
ttgaatttga cgagattcca tggtcagctc 41280 
actgcgtgtc aagcgatctt atatggtatg 41340 
tataacgtgt gatattgatt tttttatata 41400 
ggaacaaaga cttmtttt aaaaaaaaga 41460 
aacaaaatat cagtgcacat ggcaagtgtg 41520 
aaaatatttc tatatggtat tttttacaag 41580 
atctgcattt tattaaaaac tgtataaatt 41640 
ctctcttgga gaaaatgtat aagttgcgaa 41700 
ctgttattta accatagaaa gtaagaaacc 41760 
ttctctttct tcttcctggt tttgcttcaa 41820 
cattactcct actcaccaaa tttttttctc 41880 
atct tatgat cttaagatga tatccttctc 41940 
cgtcatctac accacaattt gagcccctta 42000 
ttacgttcat tatttagcat gcactactac 42060 
taaactgttt gtacagttta ataatgtcat 42120 
accacaccgg ttttggcaca gttgcaaact 42180 
cataatataa caatctcgtt ccatactaga 42240 
ctttccttcc aaaatataag aatatagtac 42300 
tgattaggct atctagattt gtagtcgtac 42360 
ctctttggat ggagggagta gtttttattt 42420 
ttttgacttt tttcttagtc aaattttatt 42480 
gcaacattta agcaccacat tagtttcatt 42540 
tgtttgtttt tttattaaaa tgctactata 42600 
agtttgatta cgaaaaaaaa tcaaaatgac 42660 
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atataatatg aaactgagga tgtagcagac 
agagcatcac caaaagatta gcaataattt 
aactgaaaat aggaagtgaa aaatcttttc 
actaattaaa atattcataa ttttcctttc 
ttgaaatcca attggacaat ccaaaaaata 
cacacaaatc ttalctcaat cccgcgggaa 
ccgccgccgc cgccgccgct cacggggaac 
gggcgccgcc gccgctgctt gggagatagg 
gttaggtttt tccccatccg tatcttcagc 
atcggacggc tcagaacgcc tccatcgtca 
gccggagggt cgaacaaacg cagtcagagg 
ttgcgggctg agatagtaaa tggactgaaa 
taaataaatg ttgaaaaggt gttlatatta 
taaaaaatat tcaaagagca ttattaatca 
atatcatatt atttcacggc gcacggtagg 
gagagaaggg acattgttta tttccagaac 
ataaatataa tcattattat agcattagtt 
cgcgcagaag agatcgatcg cgatctccct 
tctccacgcc ctgctcgtcg ccgatctcct 
cccctctatc ctccactggt gccgcccacc 
Uggtttccg ccggcgctgc tgctgcacct 
cgcttcccgc gctgttggcg cccttcgctc 
cgcggggggc agtggcgccg aggacgcacg 
caggggcgcc (cgatctacg gcttgaaccg 
cgcggccgcc gtgtcccgct acaaccgcat 
cgacttgtgc acctacggca ttctcatcgg 
cggtttcgcg gccttgggca atgtcattaa 
cactcctctg ctcaagggcc tctgtgccga 
gctccgcaga atgaccgagc tcggctgcat 



tatagcaaat ttaaactatg cttttatttt 42720 
atccctaaaa ttcaagtttt gggtttctta 42780 
cgtccaagag atagcctaaa tcttatctta 42840 
gtcacattaa attttcgtcc gtaaatccga 42900 
gagaaaaaga acagaaaaaa taataaaaag 42960 
gctgccgacg ccgccgaatc cgctcgagcg 43020 
gatgtcgctg ctgtcgcacg cggtatggga 43080 
atatggagag agaaggaaat gtgagggagg 43140 
gacacggagg cgatccaagc tgtccatcag 43200 
ggccgcgcat gcttgatggg ccgagggaag 43260 
aggagttgga ggaggtaaag tagaatttat 43320 
atggcccata gagaaattgg gaattttatt 43380 
Icaaaattag aaattaagct ccgaaaattt 43440 
tgattaattt aataaaaatt aaatccaacc 43500 
aaaatgcgca gctgttgtcg ctgacggtgg 43560 
tatcttttat aactcccatg gaactttaaa 43620 
tttttctgtc ttttttttcc ccaagagcgc 43680 
gccccgacgt cgccggccga tctctcattc 43740 
acaccatccc tgccatctcc tccttcccct 43800 
tctccgtata agacaaactg cgttgcggcg 43860 
gtcagctagg gcgggcatgg cgcgccgcgc 43920 
ggacggctcg atccaagggc gaggaggccg 43980 
ccacgtgttc gacgaattgc tccgccgtgg 44040 
cgccctcgcc gacgtcgcgc gtgacagccc 44100 
ggcccgagcc ggcgccgacg aggtaactcc 44160 
ttgctgctgc cgcgcgggcc gcttggacct 44220 
gaagggattt agagtggacg ccatcgcctt 44280 
caagaggacg agcgacgcaa tggacatagt 44340 
accaaatgtc ttctcctaca atattcttct 44400 
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caaggggctg tgtgatgaga acagaagcca agaagctctc gagctgctgc acatgatggc 44460 
tgatgatcga ggaggaggta gcccacctga tgtggtgtcg tataccactg tcatcaatgg 44520 
cttcttcaaa gagggggatt cagacaaagc ttacagtaca taccatgaaa tgctggaccg 44580 
ggggatttta cctgatgttg tgacctacaa ctctattatt gctgcgttat gcaaggctca 44640 
agctatggac aaagccatgg aggtacttaa caccatggtt aagaatggtg tcatgcctga 44700 
ttgcatgaca tataatagta ttctgcatgg atattgclct tcagggcagc cgaaagaggc 44760 
tattggattt ctcaaaaaga tgcgcagtga tggtgtcgaa ccagatgttg ttacttatag 44820 
cttgctcatg gattatcttt gcaagaacgg aagatgcatg gaagctagaa agattttcga 44880 
(tctatgacc aagaggggcc taaagcctga aattactacc tatggtaccc (gcticaggg 44940 
gtatgctacc aaaggagccc ttgttgagat gcatggtctc ttggatttga tggtacgaaa 45000 
cggtatccac cctgatcatt atgttttcag cattctaata tgtgcatacg ctaaacaagg 45060 
gaaagtagat caggcaatgc ttgtgttcag caaaatgagg cagcaaggat tgaatccgaa 45120 
tgcagtgacg tatggagcag ttataggca! actttgcaag tcaggcagag tagaagatgc 45180 
tatgctttat tttgagcaga tgatcgatga aggactaagc cctggcaaca tlgtttataa 45240 
ctccctaatt catggtttgt gcacctgtaa caaatgggag agggctgaag agttaattct 45300 
tgaaatgttg gatcgaggca tctgtclgaa cactattttc tttaattcaa taattgacag 45360 
tcattgcaaa gaagggaggg ttatagaatc tgaaaaactc tttgagctga tggtacgtat 45420 
tggtgtgaag cccaatgtca ttacctacaa tactcttatc aatggatatt gcttggcagg 45480 
taagatggat gaagcaatga agttactttc tggcatggtc tcagttgggt tgaaacctaa 45540 
tactgttact tatagcactt tgattaatgg ctactgcaaa attagtagga tggaagacgc 45600 
gttagttctt tttaaggaga tggagagcag tggtgttagt cctgatatta ttacgtataa 45660 
cataattctg caaggtttat ttcaaaccag aagaactgct gctgcaaaag aactctatgt 45720 
taggattacc gaaagtggaa cgcagattga acttagcaca tacaacataa tccttcatgg 45780 
actttgcaaa aacaaactca ctgatgatgc acttcagatg tttcagaacc tatgtttgat 45840 
ggatttgaag cttgaggcta ggactttcaa cattatgatl galgcattgc ttaaagttgg 45900 
cagaaatgat gaagccaagg atttgtttgl tgctttctcg tctaacggtt tagtgccgaa 45960 
ttattggacg tacaggttga tggctgaaaa tattatagga caggggttgc tagaagaatt 46020 
ggatcaactc tttctltcaa tggaggacaa tggctgtact gttgactctg gcatgctaaa 46080 
tttcattgtt agggaactgt tgcagagagg tgagataacc agggctggca cttacctttc 46140 
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catgattgat gagaagcact tttccctcga agcatccact gcttccttgt ttatagatct 46200 
tttgtctggg ggaaaatatc aagaatatta taggtttctc cctgaaaaat acaagtcctt 46260 
tatagaatct ttgagctgct gaagcatttt gcagctttga aattctgtgt tggaattctt 46320 
ttctcctaca gtcctattag aggagggatc ttctctgtat gtgtaaatag cgaggtatgt 46380 
atgccacctc tccgaattat ttttactgtg gttcctagac tgtaaacaag caattatgtt 46440 
atgctgttga tgccagaaaa aacataaaag tttgtcgtta tctctactaa cggatcataa 46500 
agggatttgt gactggagit tcaaacttaa Igtgtctagg cagtaatttt gacattagat 46560 
ccaaaacaat ttatagggtl tcattaaatt tcatctatgt gtactgttta ggtgtlgaat 46620 
agtltgactl gttttttaac tgaacaaaag atatgtctga agctttgttc tttaccaaat 46680 
gcagtactga tcatcacaat atatttttta tggaacaaga ttggattgta tagaatggtt 46740 
tctgatctga ttatcttatc tcaacgtatt attatgcaca tgtactaatc atgaaatatc 46800 
tgatggaatg atgtttctat ttacctgtgt gaggcagcaa ggagtgagat ggataacacc 46860 
acatactccc tctgtcccag aatataagaa gttttagagt tggacacgat tattaagaaa 46920 
gtaggtagaa gtgagtagtg gagggttgtg attgcatgag tagtggaggt aggtgggaaa 46980 
agtgaatggt ggagggttgt gattggttgg gaagagaatg ttggtagaga agttgttata 47040 
ttttggggag tacattatta ttctagaaca atactgttgt gctcaagaag cgttccaaag 47100 
atgtttcaca acctgtgctc gatgggtttt gagcttaatc ctgggacatt cagtatcatg 47160 
alctgtctca ttcttaaaca tggaataaag gatgacagca tgatttcttt gtctctataa 47220 
tcttttggct acccacagat aatagctgta aatctatact actttaaaag gagtagtggt 47280 
ggtggtgagt ggtgaatctg ccaccacccc accaccaact ctcaaaattc tgacatgtgg 47340 
gatcactgtc aatcccttcl ccaagacatg tgggatcact gtcaatccct tctccaaacc 47400 
aattgtatga tagaacagtg gaaatcacgg acagaccatg gagctctcaa ccataatcat 47460 
ccttgcgagt taataacaaa tggagcgtaa acttggcaag caaaaaactc aaattaattc 47520 
taaaattaag ctctaggatt caaaatagat Itcctctctg cattgtgctg ttatgatttt 47580 
(aattccgta acaacgcaaa tgcattttgc (agtcttala aagaagggtt aatgcaaata 47640 
ttctgattaa atgattgtal ctatgaagtt tgaatgctag tggaagctcc tttgaccatg 47700 
ttttgttgtg cgagcattta agagagtgaa gagaatgctt ctttggtgct gttctggtat 47760 
ggaaggatcc acagataaaa ttcaggttct actgcttctc tgcttglaat tttcatgaag 47820 
ctgcagtgaa taccttgttg accacttgat ctgttgcttt gaaggagaat atagtagtgg 47880 
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ccaaggttgg tgacggtgat ggtggcatgt gatcccccag atcttcagtg acccagagag 47940 
gaggggacgg cgcgtggtga gctacaaggc atactcagtg gagggcaaga tcaaggcctc 48000 
ccgtccgtag gggactccgc tgcatcaagg ccaactgctc cgaacigatc aatttctggt 48060 
acggatcact tctcctttcc tttttttttt caccttaagc actctcttga ttcttcgctg 48120 
ctacctccct taatttcttt caatatattg tggcacttga tcatggcgga gacccacctt 48180 
ccagtgtgaa tggattttgt caaagaacta aatttattcc attagcttat tttccgatta 48240 
catggaagac attcttttct ggaataaata cagaactaaa tcctgtttcc tgaataaaag 48300 
ttgttagtgt gtggcatggt gcatttccgc gcttctaaat tttataaaac ctgttcattc 48360 
aattlgaacc tgcatccaat ccaatatttt aggtgcagac aggtgcttgc ggtcaggtta 48420 
aagaagttgg caaaaatgct tctgaagaaa ggttaattgt tgtttcatct caggaggtaa 48480 
tatgcagatg attattccaa ttggcattgc ctlgccattt ttatcacgag tctttacaat 48540 
tttatatcct cctacataU ctttccagat tccagatgat ccagtgtctc caacaattga 48600 
ggcgcttatt ttgctccata gtaaagtaag tacacttgct gagaaccacc agttgacaac 48660 
acggcttgtt gtaccatcaa acaaagttgg ttgtattctt ggggaaggtg gaaaggtaat 48720 
tactgaaatg agaagacgga ctggggctga aatccgagtc tactcaaaag cagataaacc 48780 
taagtacctg tcttttgatg aggagcttgt gcaggtaatt tatttggcca tacctacacc 48840 
agagatccat atattacttt tataactgca gtttttactt gttaacattt cattgtgctt 48900 
ttacatttgt tccaagcttt caggttgctg ggcttccagc tattgaaaga ggagccctga 48960 
cagagattgc ttcgaggctt tgaactagga cactcagaga tggaagttct tccaataatc 49020 
cgacaccttt tgcccctgtt gatggtcctc ctgttgatat cttgcctaac aaggaattca 49080 
tgctatatgg acgatctgct aatagtcccc calatggagg gcctgctaat gatccaccat 49140 
atggaagacc tgccattgat ccaccatatg gaagaccaat atccacaata tggaagacct 49200 
gccaatgatc caccataUg aagacctgtc aatgatacat catattgagg gttgaacaat 49260 
gafgggcctc gtgatcaggc ccggtcctga ggggggtcga atggggcgat cgctccgggc 49320 
cccccgattc ccagggcccc cacctatctg tgcaacgagt agtagcgatc ttccagcgcg 49380 
caacgtgagg cgatgtttct ccgtgatttc gccggcctgc aactgcgaga tcgcgagtat 49440 
aacgatcagc cgatcgatct catctgccga ctgccatgct gatgccacac gcaagcgcag 49500 
catatcagcc tlatcttggt tgatcggcat gclggacgag cacatctgtl gtcgcatcaa 49560 
clgctgactg ctatatatgt gctggtgctg aalcgatcga ttgtcgtcac ggaagtgaag 49620 



EP 1 437 409 A1 



aacaaccacg gcactgctgc ctgctgggct 
cctatctaga tctagatcga gattacatag 
ggtatcaatt gataatttaa ggttataacc 
attggttaga tltgtttcaa ctaatttggt 
gtattaatat tttcttaagt attagcctgt 
caatagataa caaaaacaag taggtgtgtt 
aataattctg atatlatcag gacatcatca 
aattttaata taatgcgtat gattttttaa 
ttatattata tggtaattat tatttgaagg 
acgctgggcc ttcagaatct caggaccggc 
gtagagtact tctctaaaag agagtaccct 
tcatacgaaa gatatgcagc aactactcgc 
agtcctggtg ccgattatat gtcctgccgt 
tactctaata gggttacact acaattaggc 
caattaggaa tcaccagagc tggaaattcc 
caatgcgtta gcttgcctct tctttgcaaa 
acatgaaacc atatatttga ctgttgcatt 
gtctgaaaaa aggacaaaaa tattctagaa 
gtcttgtgcc cactctaata tctatcgttg 
gcaaacccta gattcagtta ttcagctgtt 
gttgggtgtt gatcttttct cattttttat 
tctgatgcat ctgaatgtat atttttgttc 
tcttttcagg ctgcggagct gatccatgga 
actgggtaig gcttacgcag actgaatttt 
taatcattga gtgatgatct ctttgcaggt 
caaatagtta actggagtct gtcattggtg 
tggtatagcg taagagaaac atcatgcact 
accaatagtt ttcctcgtga aaatctcccc 
gttgcacgaa gcccatcctg gttcttccga 



ctagccgcca tcagtaagta cgctatactg 49680 
tggaattatc tgttlataac aaaattacaa 49740 
gtacaaactt cagtgatttg ctggtttcac 49800 
acttctgtag ccttgtaatt tacgaatcta 49860 
tccttgatat tatgctgttg agaaagtatg 49920 
gaggatgctc aagagtaata caggcacttc 49980 
ataaitctgc gcctacaaat cttcaaagaa 50040 
atacgaatat (gattgctat ttaaagatat 50100 
tttataataa aggcctccgt ttttagtttc 50160 
cctgctcatg atccttacac cgtgtatcct 50220 
agtggaagta gcaaagttgc accatctgct 50280 
Itgcctaata gagaactgcc ctcatctatt 50340 
tcltatcttg accaagtacc tactgatagg 50400 
ctcttgagag ccgggaatag laatgtgcaa 50460 
aatgcttatg attatactga ggtacalttc 50520 
tggccctcgc ctgatatgtt tccattagaa 50580 
atgtctattt tcttccatga tggttcagac 50640 
tatgtcatgg tgatccaaat atatccttct 50700 
gtaacactat tcaattgtta ccatgttgtt 50760 
ctctgctgct gttgcttacc agtlttctta 50820 
ttccttgttt cctggttcac ctgctgcctc 5088O 
tcttcagtgc ttaatagatt taaatttcat 50940 
cgtgaggatt accgaagact gtcaggtctc 51000 
lacaggacac aaacatgaat tttgtcctca 51060 
atccaggtgg ctctgtcgaa ttgtggattc 51120 
Itggtggtgt caatctagct gagatccgtc 51180 
atccccagtc ataaccatgc cccaatggcc 51240 
Itgatcccag atctctggtg cgagaglgaa 51300 
gtccattgtg gagatccagg gcattccgga 51360 
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tcaagtgaaa gccgcacaga gccttctgca aggcttcatc ggcgcaagca gcaacagcag 51420 

gcaggcgccc cagtcctctc gcatggccca ttatttttag taagctggag gacattcgca 51480 

acaggggggt cagtggtcac tgcaaagctg agtttgttct tcagttcaac tgcagaaaat 51540 

tgcagatcgg ttgccgtagt tgctagaacg gtacatagtt gccacctaac tgtagcgagt 51600 

ggcataactt attgtgtgtt actgcccaat gttgtctctc cttgtgttca tggattcaga 51660 

cttgtgattg tagtatttct ggatcagact ggagtaaaag aaaaaaaaaa aggaagacat 51720 

gggtttaaca gtaagctcaa aacgttgaca gtagtaaaat aaaaggggtt tgttcacttt 51780 

atttccaata tcaaccttac caacatttgg cgltgaatca tttataccac atcgcttgtg 51840 

cagctgaatt tggggctgtt taaaagatgg Ictcttggat tgctaattgc ctcgcggcaa 51900 

gcgtggtacc ttgtacaata taaatataat tataactatt taatttcata attaaacatg 51960 

ttgttacaaa tctctactat tataaaaatt gaagatgttt tltgccggta ttltggtacg 52020 

tcatclgtgt atgaatccgt ttttaagttc gtttgctttt ggaaalacat atctgtattt 52080 

gattcagttt ataagatcgt tcacttttgg taatacagaa ggaatcatat aagaattctg 52140 

tttaaaaaca ctcgtatagt aacttgagac gatcagacgc ctaactacag ctcatgattt 52200 

tctaaatata tatatatata tatatatata lactagaaaa aatatatgtg tgttaaaagc 52260 

tatcttaatc Italtattgt tatatatttt agltaacaag aaatctattg tgggaacttg 52320 

tttggatata tattttttta aaaaaaatca tgagctgcaa ttaggaatcc aatcgtctca 52380 

agttagcagg agggcgagU tttttaaaga galttcttat acgatttctt ctatatttct 52440 

aaaagcaaac gaacttaaaa accgactcaa acatggatct gtatttccaa aaacgaataa 52500 

acttaaaaac cgactcatgc acagatgatt aatttttata atagtagaga. taaacgaact 52560 

cccacagtga attttatttt aactgaacca tataacaata ataagattaa aatagacttc 52620 

acccgttgca atgcacgggc attttttcta gttaaagaag aaataaaaaa acacaaaaat 52680 

ttataaaatg taaaaaagaa aaatattata attttgttag aattattatt ataatataga 52740 

aaaaiagttg ccaaaatttc tcaacgaatg tcgaataaac tcagcaatgt catafattta 52800 

aatatgatgg taatatttgt tcgcaaaact ttaatcttca atccttcaac aacatagata 52860 

tacaacgtcg taatcgccaa caagcccgag tgaccataca ggatagccga gcggtggatc 52920 

tgtactgttc ttgggtgaaa taaatctagt acattgtata tcttatctta atatctacta 52980 

ttataaaaat tgaagatatt tcttcaaaga tttccatacg ttctctactc cgtlacaata 53040 

tcggttctac tccgtlacaa tatcggtttt gtacaccccg cgcacgcgtt gtgtgttctc 53100 
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ccgt tccaat acatgaagct agagtcttgc Itctccctgg tctggcaggc cctttttcca 53160 
ccatccccac cagggccagc gggttacatt gaccgatcac ggcccacatt agtggatgca 53220 
gccagccacg ctcttcacaa atcatgtgat gaacattagc tgagttaaaa tttatccitt 53280 
gatgattgtt agaaatgttt ttttctccac atcttctctt tcaattttgg aaaaatagat 53340 
ttcttgattt ttgtgctcgt acatcactaa taaatcagtt gttacccttc cacacattgt 53400 
caatttacca tgtctatttc agctcttacc ttgtatagtc ttgactcttg agtcctcgct 53460 
attgactaag ttgctacatg cctcctacaa atcaatagac tgccataaca atattttcta 53520 
cgacatgatc catattagtc catgcaatgc aagtacacac acactactgc acgaaaaaac 53580 
tatgcaccat aacttcaaaa claacatgtl agaatgacgt taatttttca ttacaattat 53640 
attcatcgac cgttaattta ctaggcatcc tgtttaaaaa aaatattcac cgaccatacc 53700 
cacatgttcc glagttcatt aggtgatgga tcggtagtta cagcagctgg atltttatat 53760 
tttggtcatt llgaaaaatt tatttcgcaa atagactcct gaaaaaactt atcccagaaa 53820 
tagtcccttt tggagcgtca gagtggctgg cgccgtggtc caacgggaca gcgccaacct 53880 
ctctggcgcc gccccccgcc tctattcttg tttctctata tagagttgca aactttttat 53940 
ttttgtttta Itlttttgga tgttttttca ctcttagaat cacgatacaa ccaactacaa 54000 
aaaaaatlaa actcgaacgg aatatatcac ttagctagaa gtctgaaaat atagcatacc 54060 
acltatctac tttgcacctt caccaaaatt agaccataac ttctttagta aaatcctttg 54120 
atcagcatat taaacataat gcactctatc actaggtgaa atlacttaat ctaattcaaa 54180 
atataactac atgtagcctt gaaaaattct acatgccaca tatttcgtcc gtttgagttt 54240 
attattttta tggttcgttc atglgagttc ccaagtgtga aaaaaaaata aaataaaaat 54300 
aaaaaagllg cacatcctct cctctgcatt agagaggaga ggagaggaaa aattctacag 54360 
gtcacatatt tcgtccattt gagttcattt tttctatggt tggttcttgt gtgttcctaa 54420 
gcgtgaaaaa aatatcaaaa aaalaataat aaalaaaaaa attcgggggg ggggggcgcc 54480 
agccactctt aggggtgaaa acgatcggal aalatccgat ccaatclgct ccgaatccat 54540 
ccgaaataag gatatggtat gggtttttag aaatctggcg gatatggatg cggatgagga 54600 
tatggtalct ccgaaatacg acggattatc cgacattttt gtcggattat ccgataggcc 54660 
ctttaccgga taatccgaaa tlatgaacac atgtaaccac tctatctatt gcatataaca 54720 
taagttggtc catccaatga cctaattcat caattaccct agatttctta ctatgtggtt 54780 
ttcaccattt catgtcacac ttgcgtagct gtatttttat aaaatggaca tcatgtattt 54840 
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atgttgttta gcacttaagc acataatlat tacaatgggt cgtttattga cattgtgtta 54900 
tttttacttg cattgctaac Icaatgttgt attgattgca tacacacgta acatctgata 54960 
aaatttaatc cgtttctgaa ccgattccgc accatttccg acatctgcat ccgtacacta 55020 
Iccacaccca ctccgaatcc gcttaaaaal atggtttagg atatggtatg accactatcc 55080 
gtccgaatcc gctttatttt cacccctagc cactctggcg cgcttcccct gccacctcag 55140 
catcgtccca ccacgtcggc agaaggacgg cggctccagc cactctggcg ccacaaaaaa 55200 
ggaccatttc tagcataagt ttttttaggg gtctatttac gaaataagtt tttaaaagga 55260 
ccaaaatgtg aaaaatccag gttacagcag actgtgataa gcaatagcta tattgcctat 55320 
atatacacgt atatgcattg ctaatccttc aattttgtcc aattctttta aattgtcttc 55380 
acctgttgca acgcatgatt ttttttctag tcttaacctt aactaatclt aataactaac 55440 
taaaagattc gtatctttcc gatcgtcacc ttgtccatac gctaattitt cgtccgtccc 55500 
ccctccccct caaaaaaaaa gggaaaaatc cattttacac cctcgaactc ttatgcttgt 55560 
ctaaaataca cccccgaact ataaaaccgg gtataataca ccctcgagct atcaataccg 55620 
gacagttcaa gggtgtatla (acctggttl tgtagtttgg gggtgtattt tagataagca 55680 
taagagttca agggcgtaaa tggacttttc cccaaaaaaa atcccagtcg ttactttcca 55740 
tcctgagaat cggagacagg gaaaactgaa gcatacacgc aaatagaatc aaagataggg 55800 
aaaactaagc atatacacac aaatatatcc aaaaattccc atgcagctag atcgggtgcc 55860 
accgttgttg ccaaaccacc acattgcaat gtaaatctaa gactaaagcc taaatcctat 55920 
gctaagtcat caaattagac tcggttctac caatttggta atatatcaaa ttagacttga 55980 
tttttactga tttgaggttc tcgaggtgtc acactatgaa acggaagttt ttcccgttgc 56040 
aacgcacggg cactatgcaa tatcttaact aattaaaaga ttcatatttt tcctttcgtc 56100 
acaccgatct ttcgtccgtc tgtaacatca cgtgcacctc ctctccaaat cccacatcat 56160 
cataatccga cccaaaaaca aaatctcaat ctcaatccaa tcagaatcat cacaaaatca 56220 
tccaaaatat caagagatga ttataggaga tggaggggtg agcaggagca acatcatcat 56280 
cgcataaaaa ccccaaaatc aatcacaaca acgacatcat tatcacataa gaaaaacaat 56340 
acaaacaaca tacacaatca acaacactgg cggatccagc cgaggggaca acggcgtggc 56400 
agcgggcaga tcctctcggt cagatccgcc cacgggtgcc actgacgtcg ccgccgccac 56460 
cggatccaag ggagaagctt cggacagagg gagagggggg tagaggaccg ctaaatccgc 56520 
ccaccggaaa tgccgccgcc accacctccg tcggatttgc ccgagggagc gccgatgccg 56580 
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ccaccgccat cgcgggagaa gcttgggcac 
cgccggalcc atccgctgga aaagcctccg 
gccgcctccg ccggattcgg tagcgggagc 
tcggtgggag ccactgacac catcgccgcc 
agaggggcgg gggcgagggc gggggacgag 
agagggacga gtgagaggag ggggacgagt 
gcggtcgcac cccggctctt tctctcgctc 
tttttttltg agtaaaatgc acgggcggtc 
tcccaaactc tcaaaatgca tatccaggtc 
cccaaaccga cacatcctct cttggatcct 
cgtgacacgt cttttttttt cttclttlct 
ttttttccat cttctgctcg ggtcacatag 
aaaaaaagaa aaaagaaaat ttttaaatgg 
tcatgtccct gcgacatgcc acatcagcac 
ttgggaccta aatgacacac tatgacaagt 
laaggattta tatgacacac tactataagt 
tttttacacg gagagaatgc gaatttgttg 
agaaatttgc ggtgggagaa ttttttttcg 
atagggatta ttactggtgt ggtggcccct 
taaattcact tttctctctt caaggagcgt 
tagaaataaa aaagaaacat attctgtttt 
ttttaagtaa tatttaaaaa tatatagtgc 
ctgttactat cacttttttt tccattgggc 
gttctctcct tttgagtttg ggcatatacc 
tggattttat tcgattcaac ttttttgggt 
tctgaaatta ggaaattagg tttggtaaag 
acatatattt atctatttat ttatttaaaa 
ttttgtcggt gacatgggac cgggagtatc 
ccacgtggcc tgatgtaaca tcctgaaaat 



ggagggtgag gaggaggggg ggtagagaat 56640 
ccggatccgc ctgccggaaa caccggtgtc 56700 
cgccgatgcc accaccgccg ccggatccgg 56760 
gcctcctctg ctaccgacaa gggagagacg 56820 
agggttagag ggagggaccg agtgggagag 56880 
gaataaggat gcgtgacctt atccactcgc 56940 
agctgttgcg cttgtggaga ggatgcgaga 57000 
cttaaacttg tagcggtctg tcatctaggt 57060 
ctagaatttg tcaaagtgta tcatctagat 57120 
acatggcgct aatgtgactt gtcacatgga 57180 
Itttcttttc cgttttcttc tcattcttct 57240 
aaaggaaaag aaaggaaaat acaagagaag 57300 
gtctcattcg tcagtcaaaa ttatgccaca 57360 
cacgtagcat cctgaagggg ttgtggcgat 57420 
tctaggactt ggatatgtat tttgagagtt 57480 
ttaaggaccg cccalgccct ttactttttt 57540 
gttagttgcg gctgagggtt tctcgcacgg 57600 
aggttctttc tattgggaga agacgggatt 57660 
gttttctttc tttttcgagc ttctttccgt 57720 
aggacatgac tgaatgcagc tgctgtaaat 57780 
tcatttttlt caataggtaa atataaagat 57840 
tgatcaacga cattgttaag tgagattttg 57900 
tcacgtacgg cattaaaagt tttagttttg 57960 
aatattgaga taggtatact aaagttcatt 58020 
tttgttcagt Ictmttac atgtttctca 58080 
tctlgaatag ataacgctgt tgacgtttga 58140 
atalatgaat aatttttatt ttgttatgac 58200 
atgactagag gcttgggcag gagcgatcac 58260 
tcccaacaat aaaaatcact aaaattttga 58320 
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actttttaaa acttttgcat catgctggtt gttalgattg ctattgcttg ccaaaccgta 58380 
aatgatcaca aagaaagtaa agtaaggatc taaaatttaa gtaatagata aatttacgag 58440 
aatataatat ttaattgcta accctacaaa taattacgca caagaaaaca aagccagaca 58500 
aacggaaggt taattactaa (ttaaattat ggattaatta ttaaatactt gaaccatgtg 58560 
ttgcgtgcca tggcatctaa atacacatga aataatggtc atataattaa attaagcttt 58620 
ataaaattat gtgaggtttt aattaagcaa ltagcitaat gttgtaccga gtcttaatat 58680 
actatttata gaataaataa attcaaccta tccgtgtaaa atatattgct ataagttcat 58740 
tcaatgtact attgtaataa taatggccac attaggatat tttaattaat tttggaaccc 58800 
tcaaagcctc caaaattatc taggttaatt ttgaaattat acctcattta agtaatgcaa 58860 
tagaaaaata tacataaaaa taaaatatgg gtaatattag aaattgagta aattttcatc 58920 
taaattaaaa catatattgg gtaaacctcc tttatgtaaa aattaagatt tatagaatga 58980 
aatttgtaca agggataaac taaaatcggg ttaaatagaa aatggcactg ttcattgcac 59040 
tctaggtgct cgacgtggtc cctggcccta ttttccccct cagccgcgcg cgcctggctg 59100 
cctcgcgccc cgcgccacgc cacccgcgtc gcgtcgccgc tgccgcgccg tcgccgtcgg 59160 
ccgttccgcg ccgctcgtcc glcgctccgc cgcctcgcgc cccgcgccgc gtcgtcatcg 59220 
cgtcgccgtc gccatcaccg cgcctggccg cccctgaccc cgcgccgcgc cgcgccgtcc 59280 
cgtagccgcg tgcgcgttcc atcgccgctg ccgcgccgcg cgccgtcacc gcgcgccgct 59340 
cgtccgccgc gcatagcccc gcgccgccgc gccatcgtgt cgccgcgccg tcgcgtcgct 59400 
ctcgagcccc gcatccctct cgagccccgc acgtcgcgtc ttgtcgccgt tgctgccgcg 59460 
tcgtcgtcgc cgatgctgtc gcgtcgccgc tgccgcccgt cgcgtcgcct tgcgccccgt 59520 
gccgccgctg ccgcgttgtc gctgtcacct tcgcgtcccg cctcgtgccg cgcgccaccg 59580 
ctgccgcccc gtcatcgccc gctcgtcgcg cgcgccgccg ccgctgccgc gccgtcaccg 59640 
icgtgtcgcc gtcggcctcg cgccttgagc cgccgcgcgc ccgtcccctc gcgcctgcgc 59700 
cccgccgcac ggccgtcccc tcgccgtcgc cclgcgccac tgccgcgccg cccgtcccat 59760 
cgcgccgagc cccgtgccgc cgcgcgcgtc gcgtcgcccc gcclgtcacg ccgctcgccg 59820 
cctcgagcca cacgcgtcgc gccgtcgcgt cgccattagg gccggccacc cctttccccg 59880 
cgccctataa aaccccccgg ccacccccct ttcaccccac accatcccca cccattcccc 59940 
tcttcctctc ctccttcccc tcttcgtccc ctccaccgcg ccgcgccgcc gccttcgtgc 60000 
cgccgcgccg tgcgccgtcg tcgcgccgcc ctcgcgccgc cgcaccgccg ccttcgtgcc 60060 
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gccgcgccgt gcgccgacgt cgtgccgccg tcgccgtcgc cgtcgtcgtg ccgccgtcgc 60120 
cgtcgccgtc gtcggtaagc cgccgtccct tccclcgttc cgacgccgtc gccgcccggg 60180 
tgggaaggag ccgagagaga gagggaggaa ggagccggga gtaggaagaa agaaaagaaa 60240 
agagagagag agaaaagaaa agagaagaaa agagaaaaga gagaaaagaa aagaaaagag 60300 
attagagaag ggagggaaga gtgggcccca cctgtcatta gccccatcca attcccctta 60360 
gaaaaataat tctgtagaaa agaaaatcaa galcttgacc ccacctgtca gtcactatag 60420 
cgtgtggata aggttgtatt aaaaataaat gaattaggaa cagtactatt tcgcaactat 60480 
tagaattaat tcaaatttga atctttacac tagcataact aattcatttt agctccgatt 60540 
tgagtggaac ttgaacctaa attcatctaa attcataagc tttccaatgg tatataattt 60600 
actattaaat aaaatatatt tataattatt aagtaattaa tatcatatga ttaggttatg 60660 
gtcaacttaa aaatatgcta ataaataaaa ttagtattgt ggatgtaata atatttgtct 60720 
ctaacatgtc ttgccactgt aacaaccaca caaactaata ttaagtgatg tctgaaatga 60780 
atgaatgaat aggaaaatac tagtacttgt ttaatattcg atagccatat aattaaaccc 60840 
atggcttata ggttatttaa atcaaatgta gccttgtgat tatgcaacta aaatataaac 60900 
acatatagat gaatctttag cttgattagg aggaataata acagagctag tgtgactagt 60960 
tatgatatag cttgttgtcg gttgcctata tttagtaaat ggttcaatgt taatacactg 61020 
atgcacacac ataccctttt tgataaccta ctagttgcat atattaaact tggtaataaa 61080 
tgaagaacca atatattagc taaatactgg tgctagttat aaatcUgac cacacataat 61140 
tttagttcaa accacacctg aggattgttc gttataaagt tataaagtta taaagttata 61200 
caaaagataa tatgtaacta taatagtatt aaaccacaaa tctaaaatac agggcgcata 61260 
attgtcaacc tttlatgcaa acggataata tccatatata tacatcatgt ggataattcg 61320 
aataatagct ccattggtaa aataataatg taggcgaatc atggtgatga gatggtttat 61380 
cctaaacctc cccatcgaca tagccatgct atagggacct gaccatttta ccttcataac 61440 
agatctcttc cataagccaa tagcUgact aaaccacaga ttagcaaatg tgtacatcat 61500 
atattgtgct agttagtacc aatagaacca tcaggacaat ataaatacta aggaatctta 61560 
gctcttagct tgattagaat ccaatagcaa acacgagtag tatgagcagc cttaggttcg 61620 
acctcaataa (tatattltg cttgtgcata attgcttctt gtlgaatatt ggtttttclc 61680 
gcatattata gaaattgtat atcggttagt cgtgaggcaa cgtatgcagc tttcaggagg 61740 
tgaaggttga Icaagattgt atcaagaata atgactattc taagcaggca agtcatcact 61800 
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attccttgaa catgttgatc ctaattgcga 
caatgatgaa catcctactt gtgattatgc 
ccttgtaacc atgattacgl atgagtccct 
attctagaat catgcatact catatttatc 
ggaaggtaat tgagatgcgg catgtggaga 
acatgatttg tgaaaggaga aataaaatta 
gatttgggtg gtatctggaa aaggctagta 
gaagttaagc atgaaacgac ccccgtacaa 
ggagtagata gctgagcgga ggcagtatcc 
ggggctctac ggtggggcag ccattggtag 
gccatcggta ggactgcca! gtgagaatct 
agtcttccct tcccgggtgc gccagaactc 
agtgcatgag gatgaaaagt tcatggagcg 
tctgccgtga cgcatctcat gtgttgggac 
cgggtaaagt gtacatccac tgcagtgtga 
cgcggttatt gagcaccggg acatgtatta 
tggggaatgg gatattgcat gatgaatttt 
gaaggtggac atcctcagaa aaccatgacg 
acaatggatg gtggacagaa ccgtcgttgt 
cgatctatgc taggttttag gcttgtgaaa 
acctgaagcc attccttgaa ataccctcta 
gagtacggtt ggtactcacc cttgctattt 
gcccttgtcg gtacgcttgc gtatcccaca 
ctcgtttccc cagtcgactg cctgtggcat 
tcgctgttgt tctctgatag ttgttggcct 
ttagcctaaa ttcattcgtg atatgttgtg 
gatccaggga ttggtacgga taaacacaga 
cacctgaccc cctcaggggg ggggggtcgg 
tgtctccccg aggggtcgga ccgctcccgt 



aattattttg tttacaaata aaattgcatg 61860 
catgccttga ttattgttta cccttaaaat 61920 
agtcaattat gacaattgct tagagatgct 61980 
aaatgctata tgcttgggca attacctttg 62040 
calgaacgcc acattgccat gatattaatg 62100 
aacaactgtt ttcgactggg gcggacggag 62160 
ccgtccccgg tcaattaagg accgagccat 62220 
ccgcacttct cgtatgggta tagacctagc 62280 
atgcatagtg gtttcttgat gtgtgaggca 62340 
gaccgcaagg cgggtatcta cagtggtgtc 62400 
aaaacataat tataacttaa tgcatgtgtg 62460 
ctctcactgc tagaaaccgt gtacgcctag 62520 
ggtactgcca atgcgaggtt atcgaaaagc 62580 
gaggctcatg tgttgggcag tcgcggagtg 62640 
gtaaaccaaa tctattcgaa tagccgtgct 62700 
cacttggcta gactctaaat tcttaacttg 62760 
atgctgatgg agccacatcc cgagaggagg 62820 
attcaatggc gggaagctat ccttgggatc 62880 
ttaaagtgaa cactggtact aaaatttgat 62940 
agaattgtaa aattagcttt atgcaaaagg 63000 
tcatatgcat tgttattatg gtggcttgct 63060 
atatatcttt taggagagtg ttgaagagaa 63120 
agatgatcgg agtgcggtct tgttctaggt 63180 
gttaaccggg cccttatatt attttgtctt 63240 
acctggccct aatgtaagta tttaactctt 63300 
atccaactat gtatgtgtgt accaactact 63360 
agatttccga tttccaaaat cgggggtcta 63420 
gcccgagggt gatgtggccg cccccctctt 63480 
ttctgccccg agggctgagg cgccccgacc 63540 . 
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ccttgtgggt tttgcgccgc gtgtatgggt taggtgagca caacggggct cacctaaccg 63600 
tatttattgt ggtttggacg agcgcgtcac gccgcatgta gcgcagtgca gcgcgctcgt 63660 
ttatccggtc tgtgaccagt cacagaccgg tcagatcgtg ggttaggtgg caacaggcgg 63720 
tctgacacac gccicgcccc atcccgtcag gataagagcc tccaggcact tgtccctagc 63780 
ccggagccag catgciaact cctggagatg acacgttggt cccggtcaga tatatgccag 63840 
gcttcatccc aaccattaca agcaagatat tgtatgaaga agggcgaaca tgcagaitgc 63900 
tggactgaca cgtggtggac aagaatgacc gatttgtgac cggtctgaca ctggtcatgt 63960 
cgtcggcaga caaccatgtt cccacgttgc acctgctttc ggcggagtgg aggtaggtat 64020 
gggccatccc atcagaaggt cgttcggaca gcagccattg caagtctccg cccatttatg 64080 
aagagatgac agggtgatcc cctggagaga aaaaaaggag gaccttgccc acttaggagg 64140 
tgaggacgac tggaagggga gaggatctgg agagtagatc ccacgagagg aaaaaaggga 64200 
gaagagggtt tctagagtaa gagctctctg actctccagc tctttgtagc ttcttcgtac 64260 
acagatccac cagaaaatag gagtagggta ttacgcttct cagcggcccg aacctgtata 64320 
catcgcccgt gtcttgtgct tttttcattc tcgcgaactt tccacagact aggagcttag 64380 
aatctcgccc agggcccccg gccgaaccgg caaagggggg cctgcgcggt ctcccggtga 64440 
ggagccccac gctccgtcaa ctttggctta taaitaaaaa tactctaagg atattttttt 64500 
atattttatt ttcttatgtc tatatgaaat tttaaataag atagatggtt aaacatatat 64560 
tggaaaaaca tatatccaaa agtccactat cacaagcgta gcatagatac gattacaala 64620 
cgtttccgcg aagactgttt atacctactc tat tccctgt tccttgtgcg gttgtgccat 64680 
ttggggctgt tttttcatct cggattaact cgcgtggaaa ccgcgagacg aatgttttga 64740 
gcctaattaa tccgtcatta gcalatatgg gttattatag cacttatggc taatcatggc 64800 
ctaattagac ttaaaagatt cgtctcatga tttacatgca aactatgcaa ttagtttttc 64860 
tttttatcta tatttaatgc Ucatatatg tgtccaaaga tttgatgcga tgttctggga 64920 
aaatcttttt ttaactaaac atgcccaagg tgtttctcca attaagttga cccaaaatca 64980 
ttcggcgtca cctttgtctt tcactttcct tccactacaa ggtgatgaca ctgacaaaag 65040 
gtccaaaagc tacaggatct gattttlgtt catccatctg tgatgtgtcg gcaagccatc 65100 
catggagttc atccactcaa ctcctctctc tcagagagag agagagagag agagacagac 65160 
agacacatgc atgatagatt gtgctaglac ggtagtaaca ttttattgcc tccttttcta 65220 
aaattctagg ttgtttggaa aacaaaaatt ctagattgtt caataaatta ataalattag 65280 
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gtatttattt taagtcactt taggtgttaa 
ttcgacgcat ctgagagcag gtacaatagc 
agtagataaa agaggaaaaa taagagtagc 
gagctccaag atacatatgt gtatgacatg 
tttatatgta tctattgtat gaattggcta 
ggtgttggga accatctccc aagcacggaa 
taagtattag ctatttttta aaaaaataaa 
tagaaacttt ttgcaaaaac tcaccgttta 
agggagaggg gttgggaacc tcctcatccg 
agttggctat aatattaaac ttgctctgag 
catatgcaga aatgtttata tttgtgatgt 
gaaatggagg aagtactacc taaaacaagt 
tagacttaaa aataattgga attactagca 
aaagcacagg ggacagcaga cacgttaaca 
tagggggccc tcaagtctcc cctaaagcca 
aaaagggtga agcaaaactt tattctctct 
tgggacctct agctaatctc attattgtag 
agaaataagt caataaaaat tactactaat 
gatttaccac atttgacgaa acggactatt 
agaaaaaaaa aactttcctc tattagccac 
tgttaagttt atttatcaaa atatttaatt 
aaacaatcac gcagtaattc gtttcaaact 
gaataacaat agcaacagta gagttagttt 
attttaaact tagccttaga gttaattttt 
gcattagttt cttttgtcac taaaaattgt 
ggtttatcag cagtagagcg aagccattct 
gctctagctc cactcttlct ggagctggag 
ttaggagtgt agttgggtgg aactctctca 
tccacaactt cactccaaac ccaactcctg 



tttttgaatt ttaaactgct taaactctct 65340 
agactataag ccagctataa atatatttta 65400 
gggctataga tttgtagaca gctgcagcgc 65460 
tgagaccaaa cattaattat gtagtatatg 65520 
ttaaattgac tatgggtgtg ttcggaggtg 65580 
aacggagcgg tccattatgg cgtgattaat 65640 
tcaatatgat ttttttaaac aacttttgta 65700 
gtaglttgaa aagcgtgcgc gcggaatatg 65760 
aacgcagcct atacatgatt tggagccaat 65820 
tggctcttga atcatcgaag tgatagaaat 65880 
aaaatttgaa tctaaaatta tttatatttt 65940 
atgagaaaga gacatgaaaa acacaaaatc 66000 
ggagglcgaa gtcaatcaag acggcgaaga 66060 
cgtaagtaaa caaacaagtg gttaattaat 66120 
ctaaacatga caggtttgtg taccatggaa 66180 
ctcatlagat taccagttgg aaagcaatcc 66240 
aacaacgttt tcttagagag agagagagag 66300 
ccacttgaac cagttctgtc ggtgtcggat 66360 
tattcgacgt ttcgaaaaac acactttttt 66420 
tcgttttagt tatataccta tccgagtatc 66480 
tatctctala attaaatata caatccgtaa 66540 
gagcctcagc tagaaaatca aaatggaaat 66600 
ttcggcttat catccgcaac ccaaatgcga 66660 
aaggcttgtt taccatactt cattttccca 66720 
ttttttaagt tgtttcgttc attttctcac 66780 
tggagcctgt ttggcacagc tctagclcca 66840 
ctcagcccaa cagttttagg tgcaccaaaa 66900 
caaaaaattg tggagctgga tttagacagc 66960 
aagttaaatt gataagttga agctctatct 67020 



EP 1 437 409 A1 



atcaagccct ttttcttgat catgcttcta 
aggaattgga aaggaaaggc gtatatgcat 
catcaaccat cataatcatc atcatctcgc 
tcacggacaa iccaagcgaa caccgctacc 
tcttgatacc actcctacat aaacactacg 
aagtaacatc taatcacgtc ctggttagcc 
atagcagcag gatataagcc agctataaaa 
ctatagccag ctgtagcatg gactlcaaga 
taataatagt gtagtatagt aagtaaciat 
tai tagtgtg ctatagtat t aaacttgctc 
cagctataaa catattataa tgagataaac 
ctgtagccag ctacaacacg gactccaaga 
tattagtagt atagtaagca actattaiat 
ttggagttag tagtgggcta tactattaaa 
atcacattag aggagtagct ttgagacaaa 
ggtcgtacag tgatgggccg ggccgggccc 
atgtcagcct cagccctgga cgggagctag 
cattcccatc atttcgcccc tcctccgggc 
tactgcgacc gcgagtccag cgagcctaga 
gagctactcc ccaatacggg gggattgcgt 
tttgttcgtg gctttgggtt ggaaaggtga 
attagtataa alcatttttg ggtactactc 
gtactccaac cgtccgtctt atttaaaaaa 
acataaaata ttaatcatgt tttatcatct 
ttcatataaa acggacagtc aaacattgtc 
agaccaaggg agtatctacg aacaaagata 
tgattagccc ggccgtttga claacctcac 
aactccatca (ttcggatac tlagagcatg 
catattltaa tgggataaaa gatgagagag 



cctactccat ttttgtttct tggcccicac 67080 
caatgcatgc atgcgcacat caacctcgtc 67140 
cagctgacga aaatgacctg catccatcca 67200 
aacatcacag ccaacctgtt tatcactagc 67260 
cgcaggttaa ttaattaagc gtgattactg 67320 
tttaataaga caacagttag agcaggtaca 67380 
aaagagagaa aagagcaacg ggctacagat 67440 
cacaacgtgt gtataacagg Igggaccaga 67500 
tataUUtt gactatagat gattlggagc 67560 
atagagcagg tacaatagta ggatattagc 67620 
attgatagag aagagcagcg ggctacagat 67680 
cacaacgagt gtatgacaga tgggaccaga 67740 
aaattaacta ttacattggc tatagatgat 67800 
ctttttctct tagcaaaaat caagcgccta 67860 
ccaattagcg gcgaatcaag cgatctgcgt 67920 
acagcccgac agtgacaggg ggcctgacgc 67980 
ccgttgtgtc cccgggggag gggagggggg 68040 
ccacatctca gtgggggtaa aggtgtaaat 68100 
tttggacctt gtgtccgttt gactgaaccg 68160 
tgtgtgcatg ccatgtgggc ccgagcgccc 68220 
ccgtgtgagc tgtgcggtgt tgtactacgt 68280 
cctccgtcca aagcttattt ataatttgtt 68340 
aatataaaaa aaattaaaaa aataagtcac 68400 
aacaataaaa aatactaatt ataaaaaaat 68460 
acgaaaatct aatgtttgcc ttttttttta 68520 
atacatgtta taatcatgaa gcccatgatg 68580 
gagctacgtg gctgacaagt ttaacttgtt 68640 
tacaatagca gactattagc cagctataaa 68700 
aagagcagcg ggctacagat ttatagccag 68760 
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ctgcagcacg gactccaaga cgcaatatgt gtatgacagg taagaccata tgttaatagt 68820 
atagtaagca actattttat aaactggcta ttagatcggc tatagataaa ttggagctag 68880 
tagtggacta tactattcaa cttgctctla tatgatataa atattgatat aactatatga 68940 
ttttgttaat gacatgtttg tttatggatg gactatgtgg ggtcggtcgc ctccgtagct 69000 
gaccaaaata caaacttaaa acccctatcl ataaaaatct aacttttgtt tataaatata 69060 
gatataaaag (tcataatta gagcctcatc littaaacga aaagagtact atgaaaacaa 69120 
ctcgtaatac aaagactaat tacgacgaaa agaaaatagt actgacaaga ggaaagcagt 69180 
gaacttgcat actccctccg taaaaaaaac caacctagac acggatataa cac ( at at at 69240 
ctagattcgt tcgttgtaat gaagtgtcac ctccgtatct aggttggttt tttcgtacga 69300 
aagaagtatg agUaatcta aagctatgta tacccttcgt caaaaaaaaa aagtaaacct 69360 
tgtactggtg cgtgtcacat cctaatataa tattgttttt tatggagggt gtacagttga 69420 
aaaaaattga tgtgttttaa ggatgaaaaa tattggtaat gttggctatg taactctaga 69480 
aaaaaaaatg cagtaataat aaaatgctaa tttgctggag taclagatta tagacaatcc 69540 
agtccaggac acgacaccct ccctactctc tccacttcca ctctcaccgg ccaccgcgcg 69600 
ctclctctct ctctctcccc cttctcccgc aagattcttc ccccaaatcc cacccgatcc 69660 
accgccgccg cccgctcgcc ggagtcccat cgctgccacc gccgccggag ccgcggcccg 69720 
acgcccgccg ggcctgcttg ctgtgtgtgt gaggaggtgg agttgctcgc gctcgttccc 69780 
gcggccacct ccgcctgctg ctgcttctgc ttccgctggc attgcgggga ggtcgtgtgc 69840 
cgggggacgt gggggctcgt gttggagcgc ggctgccggl gaggtggggg gtgcggcgcg 69900 
gcgcggctcg cgctcgtgcg ccggtggcgc gggcgcgggg ggaagcgtac gggggagggg 69960 
gagtgtggcg gcggcggcgc gcggggtagg gacgggcg'cc gccaccacca ccggctcgtt 70020 
cgctggcagg cgctacgcgt ccagatccgt acgccggtat gcttcgtctc gccgcaactc 70080 
tctccatttg attagtatcc cctcgccgaa acgaggcctg tgaggcgccc gctttctggc 70140 
tggcttccct gtactcgctg cttgctcctg cctgttgggt taacccgttt ccatcgaatt 70200 
tgggtaagcg aaacatcgcc tcatatgggc atttggggtt ctggcagcct taggctcgcc 70260 
atccgtcgcc gagcttccaa gtgaccggcg cttgttggta talttgcttg cttgttcctg 70320 
tttggtggct gcgctaaatc ttttgtgctg cattgaattt atgccaccca tatacagcaa 70380 
attactgagc tgaaataatt cggctaalta ggtccagcaa talgacatct cgtggattga 70440 
atgctaagct gacattgtat cactgatgct ggcttatata taggttgttg agaagtgaag 70500 
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atgtcgacag gtgaaacccl gcgtgcagag 
aggctatgga ttgtgattgg aatcagtatt 
atgtgcctct ggtccatata ccgaaggaag 
tctcaaatcc cggatgtttc caaggagatt 
gtcgaaaact tccgtgtgca agaaagccac 
aaagattcag ggaaaatgct ggcacacttg 
ttgagccaat gcagctcggt gtaccaatgt 
gaaggcagct cgggcaatgc taggaggcac 
cctctggttg gtctcccaga attctctcat 
. gatttggagc atgcaacaaa tcggttttcc 
ggggtagttt accgtggtcg ac tea taaat 
aataatatgt aagagatcct gaaatctatt 
gatgecatea tattaatttt cttttgatat 
tcagggttga agttgaggct attggccacg 
gatattgtgt tgagggaatc cacaggtaaa 
ggctagcttt tgtttctact ggcacattat 
ggtctatgtc cacctactca ccagattatc 
ettatgetat cgattttttc ccttccaaat 
agattatget cagtccacag aaaccttctt 
ggecataget ccgaaagcta ggattcacta 
ttggctataa caaggtctta ctgaaatgta 
gttcttgttc ttcacgtttc ttctcatgea 
gtccaagcta ctgtattctc caaagaaaat 
ttttccgcca ttttggatac tgctctagaa 
ttggccagtg ctgcttaagt gaccatcgag 
gttgagtgtc aggtgttttc ggataatctl 
tggtgtctgc ttcacgtaat tcaaaatcca 
ttgaaaatat gattatgttt ctcccttgtt 
gctacatctt gtaataattt gtggttttca 



ctatcatcca ggacgccgcc tttcggttlg 70560 
tgggtggtga tcttctttat actaggtttc 70620 
ccgaagaagt cctttgataa gattccagta 70680 
gcagtagatg aagttcgtga gcatgctgtt 70740 
gegatategg tgcaggagaa acattacgag 70800 
gttaggagta aatcgagtga tgccgataat 70860 
gatagggctg gtagctcgta ttctggtgat 70920 
ttttctcaat atgcaactgt ctcagcatcc 70980 
ctgggctggg gtcattggtt taclctgaga 71040 
aaggagaatg tcattggaga gggtggatat 71 1 00 
ggaactgacg tegcaataaa gaagcttctt 71160 
ctgcgtttta cagaacttgt gactccttct 71220 
ggtgctgcag gggecaggea gaaaaggagt 71280 
tcaggcataa gaatcttgtc cgccttctag 71340 
gctatttatc aatcaccttt gctgatggat 71400 
ttacttgeat agggatgtag gattgetett 71460 
tcaagggata ggttattcct gactgcaclc 71520 
ctgatgglgg gattcagcat gcccaglgac 71580 
tggaccacca ttcttttacc atgaaaatgt 71640 
gaagegcaca actgettatt ggtttgttag 71700 
cttccatagt tcattacttt gtgaatgect 71760 
tgttcaattc taaatttgta ttcatgatat 71820 
cagaagtcca ttcacctatg tattttccag 71880 
acaagttaat aatatagata tttatatggt 71940 
atagaaatlg ctlaagaaat atactaagat 72000 
gttaccaaca aataggtcct atgaatataa 72060 
cactcagcca aaataatctg caatagggtg 72120 
ttcatcatga ctacagaaat gaacaatgtt 72180 
attgaacaaa acatccatca aatgalatct 72240 
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acagcaatat attttgcact tctgagcaca 
cattgattta agctttttat ttcactacat 
ctgtttccac aagtacttta tgttaatggt 
ttactgttgt ttaatatttt tagtggttag 
tactatgttt Itgtaglcta ttaaacttcc 
atgcttgtat acgaatatgt gaataacggg 
cgccaacatg gtgttcttac ctgggaagcc 
gcgtaagaaa caaaccatcg tccccgtcaa 
ttttatatgt atatgtttag ttgcataacc 
ttgcttattt acatgaagca atagagccaa 
acatactaat cgatgaagaa ttcaatggca 
tgggtgcagg gaagagccat atcacaactc 
tttttttgga gttagtatta atctttccta 
act tcgc 1 1 a ttcatacagt ataaaatttt 
ataacaggta gctttctcat tgctatcatt 
tgccaacaca ggtctgttaa acgagaagag 
ggaagcagtg actggtagag atccagttga 
atatcctaca atctcatgcg tattatgtat 
tttacggcaa aataacgtct atactaggag 
tggcagttgl ctactgtcta gttacccttg 
tcaggagctg actagctgta taccttgtca 
gcaggtgcat ctagtggagt ggctcaaaat 
agttgaccct gacatggagg tcaaaccgac 
ggcactgagg tgcgtcgacc cagactctga 
gatgctcgag gcagaagatg tcccatcccg 
ataacattca tcatat (ata (cattgcaat 
aaggaccgga ggagccggag gggcaacact 
tcaagcgaat tcgagataag tggcgataga 
ctctaagaag acggtgatca tagtcaagaa 



caataggttt gagtglattc gagtcatggt 72300 
aaccattgat Itgagtgtat ctaaggagtt 72360 
gtctccttat gctttggcca tccaaactca 72420 
tggtgtccaa atctttcttt gtgtacatca 72480 
atcctatcat ctgacttgtt atattccagg 72540 
aacttagaac agtggcttca tggtgccalg 72600 
cgaatgaaag ttgttcttgg aaltgctaaa 72660 
aaagaaaaga attgttcttc actttagctc 72720 
cattttccat aactgaattg gtatacaggc 72780 
aagttgtaca ccgggatatc aaatcaagca 72840 
aactttctga ttttggcttg gctaagatgc 72900 
gagttatggg aacttttggg tatgttgata 72960 
tgcttagctt ttactgttgg aatgtgcagt 73020 
acatgctgcg aactttgtcc ttcgtatatt 73080 
gat teat t tc aggtatgtgg cccctgagta 73I40 
tgatgtctac agttttggtg tgctattact 73200 
ttatggccgg ectgetaatg aggtgagcat 73260 
gltacaaaag teegtactat tggaaattat 73320 
agacgaattt gcttcaggtg tatggctgtc 73380 
tctcactttt acagtctatt gttttatttt 73440 
tatataacaa cactgtaacg tggatgeett 73500 
gatggttggc acaagaagag ctgaagaggt 73560 
catteggget ettaagegtg ctctcctagt 73620 
gaaaagacct actatgggtc atgttgttcg 73680 
tgaggtggta acgctttctc ctttcctgca 73740 
aaatctgaag cttttgctgl aatcctactg 73800 
gecaatgeag ataccgagtc caagacaagc 73860 
agggactcag ggccatcagc aaggtttcaa 73920 
caatggcttc aaaactctat gcagtaacat 73980 
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ggtggttggc agagaaaaag gggtatttct 
gcatggcggt agagactgga gagagcacag 
attcccattc agtattctgt ttgttagtca 
ttttttcatt cttctggttt ttttgtttag 
gtctcttttc tagaatcacc aacatggaac 
tgctttcttc ttgctgagat ctatcatgta 
tggacaagta gatgctcgtt ctgtalgtcg 
ccatggctgc agccggcttc tgtttagttg 
gtggcccatt cttgatctlg ggccgaaact 
gggattgttc ggcccgttgt agatgggccg 
ttcggttggg ccggtggtcc gctactlcat 
tccaatagaa tccaaacatt attggattga 
caggttcaga aaattttaaa ctgaatttta 
tctgtttttt taggtttgtc aacggattca 
atttttgatc cggtatcgag attgtgaacc 
agtgaagcgc cgtgcgcgcc gcgtgcgcgc 
aagctggcaa gcggcgcgcc catgacggcg 
gtcaacgcac gcgaccggcc gagatccgtc 
gcctcacacc ccccacacac cgagtcatcg 
ggatatagcc atataitata gatggcgatt 
gcggcggcgc tgctcttctc ctctccggcc 
cttcttcttt ttttctttct tttttttttt 
gatcactcac acatggatac actgtcgtga 
cttgttttgg atttttcttt ttatgtgtat 
atgattccct ggatccaaac ggcaacatca 
ctgatggcta tgctgtaagt agcggtggca 
ctcaacctgt acatttacac tatcttgttc 
gttttaaaat ctattaagtt ctagagatta 
actattaagt agtatatttt aiaagltata 



ggagggcatt gcattttgta ttgtaggtct 74040 
tgtctgatga tggatacccg gagacctgta 74100 
agcagcttgt acagatcgtt gtctgttcca 74160 
gaggctcttg gattaccagt acgaaccgct 74220 
ctatcaatat ttactactag tactacgact 74280 
ctgtacataa ctgacgtgtt cagctgcaci 74340 
aatttacttg atgaggtcga gcattaagta 74400 
tgctgacatg cggcggcgac ctcacgctgt 74460 
gtagcaacgg gcgtacggcc catclatatc 74520 
gatcgggatt gcgacttacg Igcgacccat 74580 
ctagcagtgg tcggcggcag ggttcacaat 74640 
gttaaaaaca caaaccaatc ggctttttgt 74700 
attttttgac aaaaatctat ttagatttcg 74760 
gcgaaatccg atgatatcgc tcgtgagtgg 74820 
cttgtcgcgc attgcctgac aaagacaacc 74880 
cgcgtgacgc gaagatgcgc aggaaggaac 74940 
gcggcgacga cgacccgcgc gcgtgcgtgc 75000 
agtggccgcg gctatatata atacatcgtc 75060 
ctcgccggag ttagagttcg tagcggcgaa 75120 
ggtgttggtg gctgctgcgc cgtgctgctc 75180 
accacalgta agcacgccca tcttcttctt 75240 
tttttggaaa tgagccgcag ctgacaaaaa 75300 
cactaaccaa tgcctaagcc attttgtttt 75360 
cacttttgcl tgttgctctt gcagatgctt 75420 
cgataaaatg ggatgtgatg caatggactc 75480 
gtacaccaac atctctacct ttattttcgt 75540 
tactacctct aataaaaaaa tatatttgat 75600 
ggaaagctac acatggtttt atgttttgat 75660 
ttgaaggclg gggtttcaaa aglttgacta 75720 
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cactagatct (attcaaagc gtctaatgat tactgaacgg aggaagtatg aacltataga 75780 
cttgaagtta aacagcatag ccacatctct tcatgtatac ttcatccgtt tcatattata 75840 
agattttcta gcattatcca tattcatata tglgcgtcta gattcattaa fate tatatg 75900 
aattgggcaa tgctataaaa tcttataacc tgagaaaegg agggagtatg tegcaaacaa 75960 
caacaacaat aacaacgagc aaaatctgta tegaatcegg tttccctctt gtaactgtat 76020 
caaagatctg tcctctgaaa cgtcccctgt tcatcaggcc gttgtcacac Igtccaacta 76080 
ccagcaattc cggcacatcc agccaccggg gtggcagctg gggtggacat ggcagcagaa 76140 
ggaggtgatc tggtccatgt acggcgcgca ggecatcgag cagggegact gctccatgtc 76200 
caaggagggc agcaatgtcc cccacagctg caagaagcat cccaccgtcg tcgacctcct 76260 
cccgggcacc ccaatcgacc tgeagatege caactgctgc aaggctggat cactgagcgc 76320 
attcagecag gacccggcaa attctgccgc gtcgtttcag ate 76363 

<210> 28 
<211> 53905 
<212> DNA 

<213> Orza sativa Asominori 
<400> 28 

gatcagtgag tgagagtgat gtgctattga ttttegtcta ggattttget gtgetcttet 60 

tcttcttctc ctctctacca agaaagatcg atggaggaga atttgtagga cgcgtttctc 120 

acgaattact tagctgttaa tgatcagctt gatgtgtacg atatgatggt gcagagtgaa 180 

agltgtgttg ttcactggtg gatcatggga tgggaatatg ggattgttgl aagatgtaac 240 

tcaagtgtlt tcttttttgg gattactttt ggtaataaga gcttgggtga (cgaaaacla 300 

cagatggttt ttcttttaag ttgtatgatc tctgtagagt ttttgagtaa tttgtagtlt 360 

tgtaccctat caaagatcat ctctagctgc ctctgagctc tccaactcta tatgtccatc 420 

tctagtalat atgtcccata tttctgactg aaaatlttca agtcggltgg ttccctccgc 480 

ctggatattc tttcagctaa tlagattttt tttaaatgal aaatttgeta aaagcttgtt 540 

caaattcagc taagatctat tcaaacttca atttctctat cgaaatlccc ggaaatttca 600 

attcaatcat tccccaatac atgccgattt ccglaatatt gaaccatgac atgtaaacaa 660 

cgaaggaatc aagggcatat ttaglttcat ctcacatcga atataeggae acacatttga 720 
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agtattaaat gcactctaat aacaaaacaa 
gaatctatta agcctaatta atacatcatt 
aactcatgac gcaattaggc (taaaagatt 
(tattatttt ttctacgtlt aatactttat 
tgaaaatttt tatttggaac taaataggcc 
atccctctcc acctccctgc acaaacagtg 
agcagctccc ggaaacagaa agcaatcaag 
cagcagacgg cagaaaccag tgggcgcagg 
gccaaaatac tggccatcgg gtgcctacat 
gtgtgtgcta cactgaccgc cgctgctcgt 
aatttctaat ggtttatttg ggggtttaga 
tgaagtttgg gcgtgtcaca tcgggtatta 
taataaaaat actaattatt gatcctatac 
ccaaaacttt acatccctga atctaaacac 
taattttgat ttgggaagaa ggtgagtgag 
aatttgccag cagtaaataa agaaactact 
cttctttgcc ctaaactggc ctccatttta 
aaaggcaaaa agaaagaaat tgtagaggtt 
aactctctat gcagctgtgg tctgtggagc 
cgagggtgaa gaaaacgagc gtttgaccag 
taacctgacc gagaggttga agaagatgca 
tcccagcaac aacgggttgg ttctcactac 
agtacatcct tttgtccatt atgataaaag 
caaactgttt tttaattttc tatataaaaa 
tggagtttta aaaaactcaa ttaatcalat 
ctaaaatatc ttcatcttca tttaggttat 
tagcatgttt ttcaaactgt tttttaattt 
aaaatttact tttagggttt ataaaagtaa 
ttttacgtgg actaaaatat cttcatcttc 



attacagatt ccgccagaaa actacgagac 780 
agcaaatgtt tactatagca ccacattgtc 840 
cgtctcgcag tttcctgacg aaccgtgtaa 900 
gtatgtgccc aaatattcaa tgtgacaacg 960 
ctaatattct ttcaagatat tagaatagtt 1020 
aacttctttc tccttgggca caggagtagt 1080 
caaagtcctg aacctgaagc atcctgaaac 1140 
cgatagcagl ttttcgtggt ccggcgtaca 1200 
agaatgagtc cactggacgc agctaccacc 1260 
cgaccagttg tacggggctg acttattctg 1320 
acactgaggg gtgctttaga tccaaagatg 1380 
tatatagtgt cgcacagggt gtttgggcac 1440 
gataagctat ataatactcg atgtgacacg 1500 
ccttttaaat agagtatttg gtgtgaaata 1560 
atttggaaaa aaaaagcatt tcaattaaaa 1620 
cggttttgta attaaagtga ggttttggca 1680 
taaagtgaga accglgcagc aaaagcctga 1740 
tttcaggagg atacaactag gtgggtctct 1800 
aaaacgatga aatggaagac gggacgttga 1860 
cgtcaaccat ggcgtgaaca gtagcaccac 1920 
atcaacgggg tactatagtt cccacgaatt 1980 
tcacgaattc cctgtggctc aacaactact 2040 
ttctatctta atttttattt acacgttttt 2100 
atacttaaaa tafcaaataa aatctatttt 2160 
atattattga cttattttat tttacgtgga 2220 
gttcttttct catcaagata catgatacat 2280 
tgtatataaa cttactctaa aatatcaaat 2340 
aactcaatta atcattacta acttgtttca 2400 
atctaaggtg gtglttggat ccaaggacta 2460 
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aat 1 1 taatc cctatcacat cggatatttg 
atgatgaaac ccattccata accctggact 
taatccatga ttagcctatg tgatgctgta 
gcttaaaaaa tttatcttac gaattagctc 
tacgtttaat acttttaatt agtatacatc 
ctaaatcgaa aatcaccgaa tggctcgtca 
ccaacaatcc aacggctagg aagcgcccca 
caagtgggtc ccaccccttc cttccttttt 
ttggctagct agctagcttg gcccaacgcc 
gccgagctca gcgcctcagc tccccctcct 
tttcccccaa atccgcacgc ctctcccctc 
aaatccggat cagccgcagc cgcagcagca 
cccccccacg acgacgtcac ccacatcccc 
aaatctctct ctcctctctc ctatgcgcgc 
gaggcggagc agcagcagca gcagcagctg 
cccgctcctt ctccgacggg ggaggagggg 
cgcaagacgc gttcgaattc gacggggagg 
cgtcgcagtc gtcgcacccg cccgcgccgt 
acgaggaccc gccgccgccg ccccggcggc 
cggagcccgc cacggcggcg gcggcggcgg 
agtacggcga gatgatggag agcgtggacg 
ccaccgcgcc gaggcggglg cgccgggcca 
ccgcgccgcg ccgccgcgtc ctccgggccc 
tttttttctt acttgtctgc gctgtaagta 
ttcgcgggca atcttggtga ggaatcttgt 
gtttcttctg cgtaatcttg gcggaaacag 
cggcatttct gtgggaaatg aaccaccttt 
gtgttgcaat ccgtagctat tgccttcatc 
tttaccaact tgtgtttgtt ttcttgttct 



acactaatta gaagtattaa acatagatta 2520 
aattcgcgag acgaatatat tgagcataat 2580 
gtaaacatgt actaattacg gattaattaa 2640 
tcatttatac aattaattlt attgttagtt 2700 
cgacgtaaca ctgatcgata caaacaccaa 2760 
tcctcccaca tgagatgcca agatggaaca 2820 
tcccacccac cgcctaaccg ccttcctatg 2880 
tttttctttt tacaaatccc cttccctttc 2940 
acgagccgag ccgagcacat ccggagc;caa 3000 
cctcgtccca ttcccggttt cctcctccga 3060 
cgcctccatt tttcccgatt cccaattccc 3120 
aaaaatttcg aaatccaaat ccaaacccat 3180 
acccccgcga gacgagacga gacgactccc 3240 
cgccgccgcc gccgcagcag cagcagctag 3300 
agatgatcgt gcgcacctac ggccgcagat 3360 
agcgcggcgg cggcggtggg ttctcgtcgt 3420 
aggaggacga cctcgtcctg ctggggtcgt 3480 
cgcaggagtc gtcgtcgatg tgggacttcg 3540 
ggcgggggag gggtgggggt ggggactacg 3600 
cggcggccac ctcgctcatg gaggcggagg 3660 
aggcgaactt cgcgctcgac gggctgcgcg 3720 
gcttcctcgc gctgctcggg atctgcgcct 3780 
aggggtcggt acaccaaaga accctccttt 3840 
aagaataaca attcgcgttc ttgctcttgc 3900 
tagggttatg aaattgggca gccagttctt 3960 
tgggattttg tacgattatg gc.tccgtaat 4020 
agggcatttg accttcgaac agcatgcttg 4080 
Uaggcacaa gaacttgttc tgaattalga 4140 
gagttttctt gcttggttag ggttagggtt 4200 
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atcaccgtgg tggtgcagaa ttagafgttc 
tttggtaccg aglgttacag ctgggtttag 
ttggtctctt tattttgcta atctcacatg 
gatgaatgcc tagatactag aaatacatct 
ctatgtagat atgactgtag aatgttactg 
ctagcctaat atatctctcg aggccaagag 
tctatttggt acttaatctc ttttatttgt 
ttgctaagta gacaggtaaa gttctaccta 
agattaagga gtgccagcca gagccttagg 
acaatgttca caaaactttt gtggaatttc 
tgtgaataaa agaagcactt gatggaagtt 
gaggttgatt gcttgaactg ttatggactc 
ttatgctaat gtgcacaaat aaaattgctg 
agcacacttc ctatttgtat ccattitcct 
attgttccat tcaacaaaca catattccgt 
ttcttactca caagtgtcct cttttcttat 
tgcaattttg gttttgaaca ttgatgatcc 
atlcgttttg gcaagtgatg tgaglacctc 
t teat tc t at tttttacgea tatcgattgt 
tcacatcatg gcaaagttat ttatttctcc 
ccatgaatat cttggcatgt gattacaaag 
ttcatcattt tgtttgaagt tgaactttaa 
ageagctegt attctttaat ggcataactt 
cttgttgttc aggttcaaga aaatcatttg 
cttaaattat taaalcctcc agtgaatctt 
aaacttcttg gaatcagtaa agttcaaatg 
atttcagagg aaatcctttc aaaagttgaa 
tcgctcgaca aagatgacaa gaaaacaaca 
ttgttgacaa tggaaaaggc atgcttgtct 



gctacttgtc ttaacctctg ccttgcccaa 4260 
gaagtgtgat ctttgagcat ttctagcatg 4320 
gttgtagagg aaggaagcat agtgactgat 4380 
ttattaactg aattaggatt gcttgggtat 4440 
ctggaaatgc tatccaatat ccattgatct 4500 
atcagtcaat tttgaacttt caggagagtt 4560 
tacttttggt gcctggctct cttttcatga 4620 
aaattattct taaaagttca aaatcgcttt 4680 
cagagtctta taaaccaaaa geacaatget 4740 
cacttgagct gtataaacat cgcaatctac 4800 
catgttagca aatgacatgt tttctgtgag 4860 
ttgcaacttt ttattttact tcgtacccat 4920 
agagtaaaaa tgtacaactt gttacgcacc 4980 
gttgaatttc aaatgtattc aattgctgaa 5040 
laatgaaatt attatacatt gcgttttgtt 5100 
atcctataga ttggtgcaac aaattattga 5160 
tccctgcact atlggtgcag ctgctcttcl 5220 
tcaatcccat ccttgtgctt ctgtgcatgc 5280 
tttcttttat ataacagccc ataaaaataa 5340 
agtacagtta tataagtatt caccactttt 5400 
aagattattl aagaaagtcc atgettttat 5460 
tttatggtgt aaatttcagt taatattget 5520 
cacttgtgct taltctccaa tatctccctt 5580 
ttggattcag aalcttgtgt ccattttctt 5640 
gttgattcca aagcaccatc gataggttcc 5700 
cttaatggat caaataagga ttctgactgc 5760 
gagattctct taagctgtca agagatcaag 5820 
aggecagaac tgtgtccaaa gtggcttgcl 5880 
gctgttlcag tggagggtaa gttttaatca 5940 
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aatttcttgg tcatgattlc cctttatgac 
cagttgccat aagttacata gcacctgttt 
ttgcttcacc tgcctttgat tgatgacttc 
ttgactgcac tagaagcacc tatggccatt 
tatttgattg ttacagagac ttctgacact 
acattaaggg agttgggcgg tcttgatagt 
acaitggagg tgagatctcg ctaacatcgc 
aggatggtgc aagttttgtt cctttttgcc 
gaaagcaatg cttgtccaga tacatagcca 
gcttgaggta gtaactattt tcatcaggac 
tgtttcatgt tgcaaaatag ttttttaata 
ttacagtgat tcctctgatg attatatccc 
caagtggaca ttattcattt ggatgttact 
actttggagt aaaltgcgta tccctttaag 
actttttatg cccccaacta ataatgcaat 
ttaacaatat actictggtg gaaaccttgt 
ttcagctttg gacctaaatg aaggaacatc 
tttgaaaata ttggaaaatg ccacatttct 
atattctgtt tcagtttagt acccattttc 
ttctgtgcaa aatgtgcaaa cagtgtgact 
tcctgaaaag cagtataaac tcttacactc 
tatgagtaga aaattgtacc cgaaacgctc 
tattattgag ttattatcag gtatttttct 
aaatgtttta aacatccagt atgttaaagt 
age tc 1 1 tea ataclgcaga atlcttctgt 
taaagtctct caacagagtt gctctggtaa 
gttggctttt ctgtgcactg tttcaatata 
tgttgaactt gacctgacac ggtggcacca 
atatttcatg tttcctgacc cagattatct 



cattataatt atttttatga gccaaalaag 6000 
acaatattca tgggtggttt gcttagccct 6060 
catccgtgtt gcacaactga attggagtaa 6120 
gtcatactag gaaggttttc ccttatcaaa 6180 
gtgtccagag teggaggaaa ttttaaagag 6240 
atttttgacg ttatgatgga ttgecattea 6300 
atattttaca cttcctttgt tcaactctaa 6360 
attttagctt taatgtgctt gaagecacat 6420 
aaggttgtta tattttggga catggaaaat 6480 
atggaaaatt ggctgeatea caaattatgt 6540 
cttttttatt ctgcatgtgg tgttagtgtc 6600 
ccacgataat aatacttgac atatctacac 6660 
tttccagcta tacttgetgt tettgeataa 6720 
agataaactg cttggtgctc ctatctgtgt 6780 
catattaege tgataaactg aataaataaa 6840 
gtatcagaat ctcataaagg atacctcaac 6900 
tttgcaaagt gccgctctcc tcttgaaatg 6960 
aagegatgat aacaaggtaa tgttccttat 7020 
ttcttctgta ccatcttctc ccctcatttg 7080 
ttgtatttct gcttaacatt tttctttttt 7140 
attttgette ttgcagaccc atttgettaa 7200 
ctcgclttct tttgttggtg tcattatcag 7260 
taataataca atatgtcege taacacaata 7320 
tgcagtctga cgcctatttt gttttgctgc 7380 
tgtttccagc tctacatatc egaaategtc 7440 
taacaaacac caaatttgtt tgatcaactc 7500 
gtttggtcgc cattcaagtc tcactacaga 7560 
atatttataa aacgctacct gatattttta 7620 
tgttggttcc tcatataagt ttaattagtg 7680 
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tcgttcttga aactttgtta tgcagcagat 
aagcgcaaga actcgaagaa aaaaaacctt 
tcttcaaaat cagaagtttc tcatat tac t 
cagaaggcat tcaattgttc tccatctata 
ggcgagagac atagcaatgg tggtgctttg 
gcaaatccaa ttagaggctc aagtgggtgg 
aactccagag aaatggcaaa aagacgccgt 
ggcggtgatg acccttttgc ttttgalgat 
cttggtccaa aaaagaaatc gcctcagaaa 
gttgcaagtc atgaaccaga ccaacctgaa 
tttagtgcta aagatgaatc cagtcttttg 
attaaatatg tttccttctg atctttcttg 
gtcacagttt ctcggtttgt ctttgtgact 
atttccttlt caatatggct cctcaatcta 
aatagtattg taacatctcc tagaagaaat 
tatgagacgt gatcctgatt ggaggtttag 
ttgtatctac tatcttttag tacatgatat 
agtgcttaag tttacatagt tcagctaaca 
taagtgtaga aagaaggtac tttcaaaaga 
gactcatgct tgtlctgact gtgagcctaa 
tttlttatcg ttttatgaga tcttccaaac 
gctcctttct tggatatctg gtaaatggtt 
tcaaagaaaa tgtaacaact cctggaaaag 
ccgtattggc acacatctaa ttctgctgtt 
aatgacaacc catctggttg tgaattgatt 
tccttgatca tgaagcattt cccctcattt 
agagatgtca atcttgatca tgagttatca 
aaaattaagc aattgcgaga tcatgaactl 
gttaaccttg tagagaagga tagccttaat 



gtcatggggg gaacttcatt taatgatgga 7740 
ttgtcgaacc agacacgcca tagttgctta 7800 
atatcttctg gtagtgatgc tggtctgtca 7860 
tcaagcaatg gggcatcaag tggttcatta 7920 
aagttgaata taaaaaagga tcgtggcaat 7980 
atttcaataa gagcgcacag ttctgatggg 8040 
ctatctgaaa atgtaatcac cgacagtggt 8100 
gtlgatcagg agccttcaaa ttgggaactg 8160 
catcaagaca aatcaggaaa tggagtgcta 8220 
gatcttaatc agtcgggtac aacatctctt 8280 
gaagactgcc tcttggcatc agttaaggta 8340 
tttcttcttc aagagaatat acattcttgg 8400 
ttgttgagtg acatattttg aattcacaaa 8460 
tagcatctgt cgtgtatgta ttctgtacaa 8520 
tggcaccatc catatcatac agtagcaatt 8580 
gacagagcct cgagctaaat tgctattgla 8640 
gtgctgggca ctctgtgtct gagtgtagtg 8700 
tgcatatgta agacagttta tgattaaatt 8760 
tttttaagga caatataatt gtttcaccgg 8820 
tgttaccttt acatgccctt acattgtcla 8880 
aacttgatct gtcttaatgt ttttttgcta 8940 
aggccgaagt atgaactttg ccttattgtt 9000 
tctaattttg gttgcccttt attttgctga 9060 
cctttctggc aggttcttat gaacttagca 9120 
gcgtcatgtg gtggacttaa caccatggcc 9180 
tgttttgtcg tggacaacaa ctataacacg 9240 
tcttctcaaa acagcaaggc acaccaggtc 9300 
gattttctgg ttgccatatt gggcttgctt 9360 
aggtaagtcc ctcacatgct tccttccatt 9420 
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tgctcaattc atatcagtgt tactgttctg 
tccaattaat gttcatgttc tcttaacgac 
tacggctttc ate (gcccgt gttcclgttg 
cacagagaga lgtcatagca ctcctctgtt 
aagcttctgg aactatatca ceggtaatte 
(tatatcttt gtaaaactcg gcatttatta 
attttgetec ttagcatagg tcagcagaac 
catgtaacat actgttatct atgagatgaa 
aaatatttgc tatctgaatc cagtctacca 
tatttcaaag aataatttag tgcaccattt 
cctlctattg acategggtt agaagtgaaa 
caaacccaca aatttcatta acacatacag 
tccagaagct tgagttgaat ttattttgtt 
tttttUagt tcttggttat tgcaatttat 
gcattcatta tgacattcla tgaaccattt 
atgaggagtc tttgatgcaa ggagcacggg 
cagcccttct tcttgcgttt ctttcaactg 
ttttattaat gtctaactac ttgcalaatg 
tgttaactgc aacagcatga aggttcgtgg 
cttaaaaatc cttgtgcctg cgctagagaa 
tactgtttgi ataaaaaagt atggatgatc 
atttttcatg tctgaggtgt gaggtgtcac 
tttgcaaaat ttcacataaa taaggaaaat 
ggatgtttta tttctcaact ggagaaaaca 
ctagtgtgct cccactgtaa gattgtctgc 
tatcactacc cttgagatga gagacacaat 
cagttgaagt acagcaccaa ateaagecaa 
acaacagata gaggtaaaaa gggagtttct 
gcactccagg cacaaclctt tcttaacttg 



gcagttcctt ggggtcagga ctcagaaaca 9480 
tcagaaatac tttataacct ctccacaggg 9540 
atctatctca gaatccacag agtgaagaga 9600 
ctgtaltctt agcaaglcaa ggtgctagtg 9660 
aaaaitcttc aagttccttt tgtalgtaga 9720 
cctgctcttt gtttcaaaaa gcagtatttt 9780 
agttgatctt attcagaaaa caatattttg 9840 
aattaatgea tgtgtaataa tgtcaatgat 9900 
actctagtta gaccgaaatt actgaggttc 9960 
gttcaactac tatgaagtaa aatggtattc 10020 
ggecatctta atgcaatgtt ctcaatgcca 10080 
attatlatta acatagctat aaattggatt 10140 
acaatlgaaa gcactgggaa cattagcatt 10200 
aatgttatac agaactgtgt acctcacaal 10260 
gattgactgt tgcttgtaaa caacaggatg 10320 
aagctgaaat gatgategta gaggectatg 10380 
aaaggtttgc aatctgtagt tgatggattg 10440 
tcagcactat ggcatttaac ttatactgtc 10500 
agccalttcc agctgccttc caaataacag 10560 
atttgtggta tgtctccata attcttgaac 10620 
tttgaattta ctccattttg gaaatcatta 10680 
cataattgta cltcccatcc aggaagectg 10740 
ttgaacttgt ttcaagtttg aatagtaaca 10800 
ttccggclgg gacttttaac ccttaaaatg 10860 
tgtcacattt gaaactttgt gtaatacctt 10920 
ctggtaccga gttaagttat tgataactcc 10980 
catgttggct aegtaattaa atgttctctt 11040 
aagtatctaa cclcttaccc tcttggctta 11100 
cgatttagga cttgactctg agaataltgt 11160 
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gtgcccacac tggttgagtg catgcctatc 
actctgaagc tgcctgagct tattctgctt 
ttcagtcgtt ccatctgcag ctcaatatga 
aagttatcga gaaatgcaaa ctitcataga 
taacaacctc tttgcagcaa aaaagcalac 
gcagatggac tgatactctg acctgcagtg 
ttttttttta tgttttcccc tgttgttttt 
tctagcgatg ttattlttct tagcatgatg 
taagtgtaaa gtagcaacag catagggatg 
tattcagaga cgtccataca gtttgtacct 
tagaataaat cacatgltag caatgcaata 
atcaaattct gtgtagaagt gtatggttgg 
gttttgtgga agttaagaag tccctagttg 
gatgcaactc tgattgcagc atttcttttt 
gtttattgtt tagtactaca agatttggtt 
taaaatcttg gagtaacaag ttcataatac 
tgtatattgt ctcctttgtl tttaaactaa 
ttctgaaggt ttcaccggtt tcggcccgtg 
aacggtccct ccaacacata aatggttgag 
agtccaccac gtagacactc ataacaaaag 
tctatcttac ctttgatatc ggacatccaa 
tcatattaca cctgaacttt atattattca 
tgcatacatg ctgaaatagt ttgacaacgt 
agttgcaatt cat 1 1 tatcc cttttctttt 
aattcacatt acaatatagt ataaattggt 
tattcaaaac tagtcattta agctgccaaa 
aaatggcatl acaaaactta gggggUgaa 
gaatttctat caaaagtlta tgggtacata 
cttatcgtaa acgttaaata ttttttgtac 



taagctgcta gtttttgttc attttgatta 11220 
ccatcattta ttaatccatc atgtttctct 11280 
tcacagagga aacgcactca gctgtcacag 11340 
aagagtgaag aggggcctgt acagatcaac 11400 
acacaagtgt ttgtcttggc ctggggctct 11460 
ggcttgggag ctaacaatgg tttcattctt 11520 
gctcatgttt tgtgtaattt tttcttctca 11580 
ggagtagccc tccttttttt tttctctaat 11640 
aatgttcagt gtagtgtgtg gtgtttcagt 11700 
tgtgaccaca cgtcttaatc tgatgaagct 11760 
tcatctgcgt cttctctcac tttggtggcc 11820 
tgtgctgttg caaatgccgt attccgctct 11880 
aaataccgat ttttcatgat ctcggagatt 11940 
attagaatgt acactccatg ctatcatgat 12000 
aaccattatt ttaatatcat aataatltta 12060 
atgatagcat aactttttga ggctagtcta 12120 
gcactcaata aattattgat ggctgtaatt 12180 
ctttataaat agcttcggca caaaagacaa 12240 
tttacgtttt cattatcttt ggtaaaatca 12300 
tttgaatatc ctcagaaatt ttgacttgag 12360 
ccctccctcc ctccctgaac tttatattat 12420 
tattacaccc Igaagtggtt ttcatttaat 12480 
gagatgcaca aaatctacac gttcgtctta 12540 
tctctcttac ataggaatat caatagtacl 12600 
gatcgattat tggcaatata ctatattaaa 12660 
taaglaaacc aclatcgaaa accacaatat 12720 
tatccaattt taaagttcat galgclagag 12780 
tggacttttt cctttttaaa agaagctatt 12840 
tttatum atgattgaaa aaaaaactla 12900 
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gttttcaaaa tgattggtct gtalacaagc 
gtttcctggc agaaactgta atttgttttt 
tgtgtacgta tatccgatgt gacaaccaaa 
ttacagatat atttgatggg tgtaaagttt 
gtatacggac acacatttga agtattaaat 
ctgcctgtaa acaacgagac aaatttatta 
tactgcagca tcacattgtc aaatcatagc 
atttacatgc aaactgtgta attggttttt 
gtccaaatat (tgatgcgat ctttttggcc 
atttgctgaa tttttccaga cgtcacggct 
caccgacgcc ttggtttcca acgaatttta 
atcgccatcg gcggccaacg gcgaccgctc 
gccgccccct gatccaaagc ctccgccgcg 
ggaggaggag gaggcgtgag cccctatggg 
cgccgccggc gccggcgacg ccacgccgtc 
gaggcccccc ccccgccgct cgctgatctc 
gatttgggtg tttttttttt tccgcagcgg 
tggagtgcca gccgcatcgg gtgcgccgcc 
cgagctggag gaggcggagg gagaccgtgg 
claccatggg ggattcgccg caggcgctct 
gcaaaatgtg ttgctatgtt cctctcgctg 
ctggaacatt acattcggaa cgttgttggc 
gggagaaaaa tcgtttgaac ctgcagtgac 
ggtgtggaaa tcaaacataa tcattgccag 
ttgttggctt ataacagtta gctccacacc 
taaattgagg ataactggca gttgttaaga 
agctattgtg catttatgtt Icatggaatl 
gtataatgca aaacaaaaaa aaaaaaaact 
atgcaaataa aatcatgttc atggacaaac 



atcaattaga cttaataaat tcatctaaca 12960 
gttattagac tacgtttatt atttcaaatg 13020 
cccaaaaatt ttccctaact ccatgaggcc 13080 
tttaagttct ttgggtgcaa agtttttaaa 13140 
atagacaaat aacaaaacat attacatatt 13200 
agcctaatta atctgtcatt agcaaacgtt 13260 
gtaattaggc tcaaaaatat tcgtctcgta 13320 
tttttcgtca acatttaata ctccatgcat 13380 
aaattttgtt ggaatctaaa caaggatcaa 13440 
tgttcatcca tcgttcgcat cgcgattcgc 13500 
tcatccgctt aaatacatcc aaagctctcc 13560 
cgctclaccc aatccaccca tccactcgcc 13620 
ccgccgtcga gaggaggagg aggaggagga 13680 
gaccctcctc cggccgcgtc cgctcgccca 13740 
gaccgcgcac ggtagccacg cgcctctcga 13800 
tcttctcatc ctgtttgggt ttgggtttgt 13860 
tggtggtgag cggtggccgc ggecgtggcg 13920 
gcccgggtcc gcaggttgcg gtggcgacgg 13980 
tgagatcgga tttcgccgct ggtgglgccg 14040 
cagglttgca gcctcctcca ctctcttctc 14100 
ggctggcctc atagccatta atgtagtttg 14160 
aattgcttga caaaatgtgg aaltgtggag 14220 
aaaaltgcca tctataattt laaaactgaa 14280 
cacatcattc ttgttaacca ccttgacata 14340 
aacttggaag gtgtcaatgg aatgtaagta 14400 
ctttctacag aacttgtagc agctaaaact 14460 
tgagcggcaa tggatatttc Itactaagac 14520 
atgtctatgc agtttacatg taatgtgcgg 14580 
taatgggatt cataccaaat tccagaattg 14640 
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catttcttat gtggtiactt ttgtttgttg 
agggtcagag gggtttgctt ctacgcggtg 
gccatggttg tggttcatcc acttgtgctc 
cactacattg caaagatttg ggcaactctg 
gagggaatgg agaacctgcc accgaatagt 
agtttcttgg atatctatac ccttctaact 
acaagtatat ttatgttccc aattaitgga 
ttgcggcgta tggacagcag gagccagctg 
aagtagacat aUtacattt acagtatttg 
tgattttggg gaaaacacaa aactctcttt 
acagttatag caccttcttt aaaatgaaga 
cataatgaat tttgtltatt tctatctttg 
atcgccttcc tatatatttg tagtatgtca 
ggtttgttaa aattcaggac tgtcttaaac 
ctgtattttt ctttccagag gggactagaa 
ttcagtaacc aaacttaggt tacattacat 
gttaaccttc tcalggtgta ctgacgtggt 
gtggctacaa agaccggtgc tcctglgata 
atgccttctg gaatggaagg catccttaat 
ccaattgaag ggaatgatgc tgagaaatta 
actcttattc taaacggtta tggagtgcac 
tggaacctat tcaaaggcac agacaggctt 
c tagt tacta attactttcg taatcagtat 
acatttctgc acttggtaaa tttacagaag 
attcacccag tcatatagtt gaagaggcaa 
acctcttgtt cgattaaaaa tgaccaaaga 
tagatatggc ccattaattt gtttagttgt 
gttaccattt ctatgatcta aaacaatcaa 
tcagtacaca tgtatgaatc ttaatattct 



atttggttac cagacatcga tgtggtttca 14700 
actgcagttg cagcaatctt tttgtttgtc 14760 
ctatttgacc gataccggag gagagctcag 14820 
acaatttcca tgttctacaa gcttgacgtc 14880 
agccctgclg tctatgttgc gaaccatcag 14940 
ctaggaaggt gtttcaagtt tataagcaag 15000 
tgggcaatgt atctcttagg agtaattcct 15060 
gtatggctgt agtctcatcc ctgctttctt 15120 
gtaaataaac aagattttat gaatcatata 15180 
gttggctgcc ttgaacatag ttctgttcac 15240 
actttgttgc atacacataa ggccaaacca 15300 
aatgttagca tcgtttttgt ttaatgcatg 15360 
acattgtatt ccatgctgag cataacaaat 15420 
ggtgtgtgga tttggtgaaa aaaggagcat 15480 
gcaaagatgg aaagctaggt gcatttaagg 15540 
ctaatgagat ttttatattc agtatataat 15600 
tataaatgtc cccagagagg tgcattcagt 15660 
cctattactc ttctcgggac agggaaactg 15720 
tcaggHcag taaagctcat tattcaccat 1 5780 
tgttctgaag caaggaaggt gatagctgac 15840 
taaagaaaga tggtgttttt ttttattata 15900 
tcaaggctaa gcttgttaca ggtactgata 15960 
aaataagctt gtgtagtgta atggcattgl 16020 
aggcaagtaa tattttagag gattgagttt 16080 
gtaacctgta agagaggact gaacattaac 16140 
gcatcaaaca tgtattcgag gctgttactt 16200 
ctatgtacat cctagttggt gtaaatgcca 16260 
ctcttttagt alaltttcaa aaacgaaaat 16320 
tctctagctc gttacaaaag caacaaaggc 16380 
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accgtgtcag ctggttcaca ttagctagtt tgtactlagc at tatccact agcacctlat 1 6440 
tttcatgcat atcatgctaa tttgcttgcc cacgttgagt gggaattttt ttcatgtttt 16500 
ataatttata tatgtlttag acttctagtc cacaatttat gtacttcatg ttcctgagcc 16560 
tctagtatgg ctgatagcag actaggtgct gagtgctgtc cttttttgca gactgaagag 16620 
agaagaaata caagactgtc cattgttagt cagatttgta aaaatagact ctgalglagt 16680 
ttacttttgc ccctatttta tttttaacaa tacaaatata taacagatcc taagaactta 16740 
tcttaattta ggagaagttg ctcgtttcat taaattaaat tgtgaagtaa aaatgtgtgc 16800 
tcgagtctgt caalgcaatc ctgtgttctt gtltgaagat atggtgtagg gcaggccagg 16860 
attgaacact gaatggtaag actgcttctg ccttcagacg ttattgctaa atttttagct 16920 
acttgcagtt agtgctgcca cgccgattaa gcagtagaac aaagtagttt tgtcgtgcac 16980 
aaatgagtta tatttcattg gaaatcgaag cgaaaacgaa tcaaaagtla gaagaaaagg 17040 
ggaaacttgg taattactcc ataaagagag tgcattttat tggtaagatg gtatccggaa 17100 
gctgtgagct ccgggctgta tgtattctgg caaatttgat atgagatgct cgattattgg 17160 
cttaagttag cgalatcaaa tttggggaag caccaaagga attattgtga aggagttatg 17220 
ggtgcgtgac gttatctgct aggttcaaat ccttgtggct atgaatattt atctgctagg 17280 
ttcaaatcct agtgactatg aatattaatg ggtaaggtaa gggatttatt gtUatttta 17340 
gtttctttaa gattgtgcca tcggacgcca ttcggtaact gtaataatgc tttgtattgg 17400 
attcactigt gttacatgca cgcactaaac atgtgcttta ccttttcatc tgtttttgcg 17460 
ttctgggcta gaaactcaaa cgttgaattt tccatggtct gctcaacttg acaattactg 17520 
cgtgtcaagc gatcttatac gcatactatg cgcacaagtg attgtatacg gatatgatga 17580 
cagtataacg tgtgatattg atttttttaa taaaaaaatg atgttccttt ccltgatgaa 17640 
ggaacaaaga ctttttttaa aagaagggta ttaclaaaaa caaaaatgac aaaaacaaaa 17700 
tatcagtgca catggcaagt gtgctcggca attttttctc tgtactttaa acaaaaatac 17760 
ttctatatgt lcttttttat aagggtggca caaatctttt aaatgagcca aatatctaca 17820 
ttggatttat taaaaactgt ataaaltata atttatactc tgaaaggttg tgtgcatctc 17880 
tcttggagaa aatgtataag ttgcaaacaa acattaatcc acgltatgta actttttttc 17940 
gccggaaagg ccgaaggagg cctgacggag cgtggggctc ctcaccggga gaccgcgcag 18000 
gccccccttt gccggttcgg ccggggactc agggtgaaat tctaagctct ctgtatgtgg 18060 
aaggttcgcg accgtcgaaa gagcataaga cacgggcgat gtalacaggt tcgggccgct 18120 
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gagaagcgta ataccctact cctgtgtttt 
gtatgagcca gcctctccct tgttctgggt 
cccctctaag tgggcaaggt cctcctttta 
ccctcctttc tgtggggact taccctacct 
gccgtccgaa tgaccttctg ataggacggc 
ggtgcgacgt gggattatgg ctgtctgctg 
attgctcatt cclgtccacc acgcgtcagt 
acaacatctt gcctgtaatg gttaggatga 
tgccatctct aggaggtaac acgctagctc 
tcctgacggg atggggcgag gcgtgcgtca 
gaccggtctg tgactggtca cagaccggat 
gcgtgacacg ctcagccaaa ccgcaataaa 
taacccatac acgcggagca aaaacccacg 
gaggcgggtg cggtccgacc ccctcggggg 
gggcccgacg tcccccgagg gtgccaggcc 
tagtcatgat actcctgatc ccatgtcacc 
atcgccctct ttaagggaag cggtcgggcg 
gggaccggtc tccacaattg ggcagaaacc 
ggtaactcta ctatcaataa tgagcggtct 
cacgaatctg gacatggcga ttcgtttcgt 
cgagcgtaat taacgcgcgc acgatatgat 
ctcatgcgcc gcaagcgggc gaatgggatt 
gcgaaatagt gggcgcgaga agcgaggagc 
actgaggaaa ggatctgttt ccttcctttc 
gccctaactc cltctttcct gtgctctact 
cttcctccgg cgccatggca cggggctccg 
cccgcatcgt gagcgagagg caggclgggc 
ccggccggga gatagtcacg ctgggtgagg 
ccgtcttctl tctcccctit gcaatggcag 



ggggggatct gtgtatgaag gagctacaaa 18180 
tccgaatctg gaaaagtcca gtccagtccc 18240 
tatcttaagg ggataccaca tgcaccatct 18300 
tttcalaaat ggacggagat ttgtatagtt 18360 
ccatacctac ctccacttcc gccgaaagca 18420 
acgacatgac cagtgtcaga ctggtcacaa 18480 
ttagcaatct acatgttggc ccttcttcac 18540 
agcctggcat atatctaacc aggactaacg 18600 
cagctgggga cgagcgccta gaagccctcg 18660 
gatcgcctgt cgccacctaa cccgcgatct 18720 
aaacgagtgc actgcacttc gttacatgcc 18780 
tgtggttagg tgagccccgc tgtgctcacc 18840 
aggggtcggg gcgcctcggc cctcggggcc 18900 
gactaagagg agggcgaaca catcaccctc 18960 
acgtgggcga ttgtgtctgc ctcaaacctc 19020 
gacagtagcc cccggcgtta tgccagggcg 19080 
tgacgccact cctaaggcct ggtgacaggt 19140 
caacggtcac aaatcacgca catcggcaat 19200 
cttcaagact gccacattac tcgagtagca 19260 
ctggagatat ggtaacgtcg cttlggtcgg 19320 
ctatctcgac tgccacaacc gcatatccac 19380 
agtggaagcg tgggcgcgag aaacgagggg 19440 
cgggcacagc gttggcaaga gtataaaggc 19500 
gccatcattt cccttgtctt cgccgcttgc 19560 
ttcgccacac gcgctcgctc tcaatcttct 19620 
ctctgctcga tggtagcgtg ctgccgcctt 19680 
tgccgcgccg cttcatgccg gaatctgcca 19740 
gacgcccggc gccagactac ccggggcggt 19800 
ggctggttcc gccattttct tctttcttca 19860 
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tggatgttct gaagttctac gatctccaga 
cattggccat cttcgcgcat ctgtgcgaga 
tgttccggtg gttcttcacc gtgcagtcgg 
acttccagcc atgggggccg gtgctgaatc 
gggacgactg gaagagcgac tggttctaca 
ttccgagcca gcccccggcg caggcctcca 
gctatgacgc cgtcctcgac cgcctggcgg 
tggtgtacgg cgactacctc cgtcglcgga 
cctgggagta caccgggtcc gaagactaca 
gggctcctga ggatttcaag atagtggtcc 
cgtccctcat tccccaagga atcctccctc 
tgaccattat gacggcggtc ggggcctcag 
caggcgggag ccgtaggggg gatcaatcta 
gcgacggagg cccgaggagc agccgccctg 
gaacacctcc cccatctcct ccccgagggg 
cggagggggc cgcgccgaca tcgcagcccg 
agatggggga gacagaacca tctcggggaa 
accgaccccc tcgcaggttc gtctctcacc 
agttttcact cacccatctt gttcgtcttc 
glcgcgtccc tcccgccatt ccaagtccgg 
agaggcccgg aggcgggaat ctgaccggcg 
ggaggccgcc caggaggccg cccgggctcg 
cgcccgggcc cgccaggccg aggaagccgc 
cgaggaagcc gctcgggaga aagccggatt 
ggcagctcgc gatgaggtcg cgggcgcgtc 
ggcgacaact tccggggcag ctggcgacga 
ctcaggcgac gcccaggacc aaccaggtct 
cggcggcccg agccgcgtgg catcctctcc 
cccgctgagc gcagagcccc ttctgcaggc 



tggcgcacct cacccccaac gcggtgatga l 9920 
tgttcattgg ggtgcgccca tctcttcggc 19980 
tgtcgccgcc atcggtagtt ggtggctgct 20040 
gctacatccc ctgcgccctc cgcaagaagt 20100 
cccccctcgc cgacgaagcg cgcctctgac 20160 
gctggcgggc gccggtagat ctgggggatg 20220 
gcctacgatc ccaggggctc acaggggcca 20280 
ttgcgccgct ccagcggcgc gctcggggcg 20340 
tgaggaccca ccagggagtc agatgggact 20400 
aacgggtgct gaatctcaac tccatggagg 20460 
tctgcagcga tccagaccgc gcctccatcc 20520 
aggagtgagc tccaaagggc cacgacggcg 20580 
ccccgggagg gggtcgtgct tctgggtctc 20640 
ccgacgcccg ggggaagagg aagcagggag 20700 
gcggggcggt gcgtgccaac agcaggcgcc 20760 
agggggagcg caagaagaag cggctccgca 20820 
accttatttc ccctccaaag tggtcgttta 20880 
catcgtggct gtattcattc tctcaacgcg 20940 
tggtcttttc ttctgtttca gcgagatccc 21000 
ccagtctgag gccgaggatc cggcggccgc 21060 
agaggccgcg gatcgcctac gggaagccga 21120 
ccagggcgag gaaaccgctc gggaggaggc 21180 
tcgggaggag gccgcccgag cccaccaggc 21240 
tcgccaggac gaggcaatgg cgacttccga 21300 
gcttgagccc gc t tec tegg gcgacgctca 21360 
ggctgeggge gcgtcgcttg ggcccactcc 21420 
gagggacatc cccgagtccg gcacttccat 21480 
aaggeggetc ttccccacgc cttctatcgc 21540 
cttggccgcc gcaaacatcg cggtgttgga 21600 
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cgggcttagt gcccaggtgg aggccctgca 
ggcgcgtgtc gaggaggggc ggcgctcagt 
acaccgccgg catgtctcgg agcttgaagc 
ggaagtggag gaggagcggg. gggctgccct 
ggacaccctc cgccttcaat acgggagctg 
cgcccagggg gtgcttgacg ttgccgctgc 
ggcgtcccga cggcgcgaag agacccttga 
ctgcgtcgtg gagagggatc tggcggaccg 
. actggcggcg cacgagtccg cctgtgccga 
cgcgctcacc gagcgggagc gagctctcga 
ggacagcclg tccctccgcg aggcagcgca 
tgtccgcgcc gagaggaccg cactaaacca 
ggagctggac gcgagggcgc gcagcggcgg 
ccgcctcgct gctgccgaac ataccatcgc 
cggggaggtc gaggccctcc gcttggcagg 
cgtctcccgc ctagatcgcg ccggtcggca 
gtacgc'cgcc aaccatggag gcctcgccca 
acggctcccc gaggagctcg agaagacaat 
agcggtggag ctcgtaclgg cgagttacca 
ggcgctggat gagttccctc ctgggaccga 
ccgaccatat cgtccacagc ttcgagggct 
ccgacgagga ggacaatgcc ggtggtgcag 
gcgtatcgga ttgatccccc aagcccccgc 
tctaaggcct tcgggcctct tttttgtata 
aaatttttgt gtcaatttca tcttgctgtg 
tcctlttgcg tcttagcttg attaagggct 
gggtcggggc tagtgcctgg ggagatccac 
tgaccgaggg ttatgggtga cccgattgtg 
cacgccccgg ggcacggctc ggttctgggc 



agcagagtgg gcggagctcg acgccgcgtg 21660 
ggaggccatg gtggaggtgg gccgcaaggc 21720 
ccgtaagaag gtgttggcgg aaatcgccaa 21780 
cattgccacc agcgtgatga acgaggcgca 21840 
ggaggcggag ctagggaaaa agctcgacgc 21900 
ccgagaacag cgggcggggg agaccgaagc 21960 
ggcgcgcgcc atggcgctgg aagagcgcgc 22020 
cgaggccgcc gtcactatcc gggaggcaac 22080 
agaggagtcc gcactccgcc lecacgagga 22140 
ggaggccgag gccgcggcgc aacggctggc 22200 
ggaggagcag gcgcgccgca ctctggaatg 22260 
gcgggccgct gacctcgagg cgcgggagaa 22320 
ggcggctgcg ggcgaaaacg acltagccgc 22380 
cgatctgcag ggcacgctaa actcgtccgc 22440 
cgaggtaggg cccggcatgc Ittgggacgc 22500 
ggtgggcctc tggagagggc ggaccgtaaa 22560 
gcgcctctcg aagatggccg gggctctcca 22620 
taagtcatcc tcgagggacc Icgcccaagg 22680 
ggccagggac cccaatttct ctccatggat 22740 
ggacagcgcg cgcgcaggtc cgggatgccg 22800 
cagcccctcg gctcgcgttc gcccccaact 22860 
acgacagtga cgatgaggcc ggcgacccgg 22920 
cattcttcag ttttttcttc ttttccttct 22980 
gatcaactta atctgtaatc aaaaatgaag 23040 
tgtatgagat gaggatgatc tgtgacgtgg 23100 
cgtgcccagg tcccagtcct caaaaggcgt 23160 
atgtcgagac tggccaggcc gggaacgtgg 23220 
ggtltttgcc gattcccccc cggagttcac 23280 
cccgtttggc gattttagcc gacccgagcc 23340 
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cccgagggca ggattgagca cgagtgacct 
aaaaacacag atacagcctt taggaaattg 
agaaataaga atgtgcatgl gtggcagccc 
ggggttggcc cgagcccgaa acctgacacc 
aaggtgttcg atgttccacg ggttaggcag 
ggagcccggc cgggggacgc cgaccactcg 
gctcaatcca gcacgcgttt ggacgcggcg 
ggcccggacg tgacgctgat gglagcgccg 
ggccgcgcgt cgccttcgct cttccaagta 
atcagcctcg cagtacatgg tggcccgagg 
cgcttccgcg ccgtagacga ggaagaaagg 
tcggtttgcc cagagcaccg ctggcaactc 
tatgttgaag gtcttggttt taaggccttt 
gccaitgctt ctggggtggg caggtgaggc 
gtagtcgccg aagagttcac tagtgaattg 
cactccaaac cgggccgtga tgcccttaat 
gacgaccgga taagcctcgg gccacttagt 
aaacccgccc ggggcccgcc taaacggtcc 
ccacgaaagt ggtatggtct gcagggcctg 
ttgacacgct ctacatcgcc ggaccaggtc 
gaaaccctgg cgaaaagctt taccaaccaa 
gccttcatgg aiatcggcaa gaagcacaac 
gattccatta gccgcgcgcc gatagagggl 
gcgatggacg cgttcactcc cttcgcggtc 
ttggatctcg gcaatccaag caatcaatct 
cgaggcctgg acttcaacgg gcctcggggg 
gtcgggtcgc gccgacggct gggcaagcct 
tcgcgtggac gcgagccgtg agagttcgtc 
ccgaagctca atcccgtcaa aatggcgctc 



atttcaagtc aagattcttc aaaaggaaaa 23400 
aaactgcttt tattgaaata ctgaaataag 23460 
ccggccaacc ctgcacgccc gagggggtgc 23520 
cgaccccccc cctcaggggt agaagcgacg 23580 
ctcaatgccg tcgcccgtgg ccagccgtat 23640 
atacggaccc tcccacattg gtgagagctt 23700 
taggacgagg tcgtcgacgc agagtgatcg 23760 
caggctctgc tggtagcgcg cggctctgag 23820 
gtcgaggtca tele tgcgaa gctgatcttg 23880 
agacctcagg gtgagctegg atgggagaac 23940 
cgtttccccg gtlgctcggc ttggtgtagt 24000 
ctcgatccat gaatcgccgt gcttcttgag 24060 
gaggatttct gaattggcgc gctccacttg 24120 
gaagcagagc ttgatgecca tgtcttcgea 24180 
ggtgccatta teegtaataa tacggttagg 24240 
gaatttaagt gcggagtgct tatcgatctt 24300 
gaacttgtcg ategegacat acagataclc 24360 
caggatatcg agcccccaga cagcaaatgg 24420 
ggccggctga tggatttget tggcgtggaa 24480 
gaccgcatca ttgagagctg tcggccaala 24540 
ggtgcgcgag gcggagtggg ctccgcattc 24600 
gccttgttcc egaggaatge acttcaggag 24660 
cccttctacc ageaegtage gtttggagat 24720 
ctegggtaaa gtcttatctg tgaggtatgc 24780 
aagggagctg ggagcgctcc cctcgggtcc 24840 
ccggtcaggc gcgtccgtct cccctaaggg 24900 
ttcttcaaag gcgcccggtg gggtctgggc 24960 
ggcaatcatg ttatcccgtc tgggcacatg 25020 
catacgccgt act tggegea cgtaggcgtc 25080 
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catctgcggg tcagagcacc ggtactcctt 
gcctaacacc aggaggcggc ggatccccag 
tccctcgtac tctgccatat tgttggtcgc 
gacgtctccg ctcggagagg tcaacgtgac 
gccgtcgaac tgcattaccc agtgggcggt 
ctcggggatt gagacgggct cgggagccgg 
ctggctcttg atagcgtggc gtggttcaaa 
ccatttcacc acccgtcctg taccgtctcg 
cgtaaccaca gtgacccgat gcgcctggaa 
aatagcgtaa agcatcttct gggcctgagg 
aclaacaaag tagacgggcc gctgcacctt 
cgcatccctg gggcactctt cgtccaagca 
gatgacctcg gggtcggagg ataacagggg 
ctgggggtca ggggctcctg gcgtcgtcgg 
caggggcclt aggcctccgt tcggctcggg 
agtcagcaca gggttagcct cggggtcaaa 
ctcagggcct tcctgggggt cgggggctcc 
aactccggtc gtcgggggcc tcgggccacc 
tctcccgggc caagtcggca cagggtgggg 
ctccacaacc aatgccgcac taactacttg 
ctcgtctggc tccggggcga ccagaactgg 
gagggcatlt tcagcttcct tcgtccaggt 
taagggtaac gcctlctctc ccagcctcga 
ggtgacgtat tgcacatccc taagtttgct 
cttctcgggg ttggcctcaa tgccccgggc 
aggtacaccg aacacacact tatcggggtt 
aaaagtttcc gctagatcta tgagtaacgt 
atcgacataa gcttcaatat tacgtcctaa 
t tgaaaagta ggacctgcat tctttaaccc 



acagacttgg ttaacgacca gctgggagtc 25140 
tccagctgcc actctgagtc cggcaaggag 25200 
tcgaaagtcg aggcggacca agtatctgag 25260 
ccccgcaccg gcgccctgaa gagacaggga 25320 
gtgaggcagc tgcgaggggt ccgtgctggc 25380 
ggtccaclct gccacaaaal cggcgagagc 25440 
gtgcaaatcg aactcagaaa gttcgattgc 25500 
attatgcaag atttgaccga gggggtaaga 25560 
ataatggcgc agtttcctcg aggccatcag 25620 
gtatcgggtt ttggcgtccc ggagggcctc 25680 
tcggtggggc cgatcctctt cgctaggggc 25740 
gcctcgcggg gcgcacttgt cttctgtgct 25800 
cggccttccc acagtggctt tggggccgtc 25860 
acaagcgggc aaagggccaa ctccggtcgt 25920 
ggcctcttct ccctgctctt tcccgggtcg 25980 
gggcgatagg tgcggccttc ccacagtggc 26040 
tagcaccgtc tgacaagcgg gcagagggcc 26100 
gtlcggclcg ggggcctctc ctccctgctc 26160 
aagcgcgaaa tgagaattgt cctcatcgcg 26220 
cggggtcgcc gctaagtaga gtagcaaggg 26280 
gggagagctt agatacgcct Icaactgggt 26340 
aaacggtccg gagcgtttga gaagctlaaa 26400 
tatgaaccga cttagggcgg ccatgcaacc 26460 
gggggggcgc atccgctcta tagcccgtat 26520 
agagaccaag aacccgagaa gcttgcccgc 26580 
taattttatg cgggcggagc ggagactctc 26640 
ttcclggttg cgcgtcttta caaccaagtc 26700 
ttggclaccc aaagaaattc gagtagtacg 26760 
gaagggcatt gtcgtataac aataggttcc 26820 
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tatgggggta atgaacgcag ttttttcctc 
accagagtat gcatctagaa aacacaaaag 
atctatgcga ggcagggggt aaggatcctt 
gatgcacatc cgaagcttgc cgttcgcctt 
ggcggggttg acgctgccat catatttttc 
acggacattc cgaagactcg ccacaaaggc 
ggagggctga ttcccgcgtc cgtgttgagg 
tgcgtgggca gcacttcggt cattacgccg 
cccacgcaga tctttctggg tcgaaggagt 
agcggclgcc gctcgtcgtg ccctccgtct 
agaggccttg gtggcggagg accgcccacc 
taatttggcc acgtcgtcca gccatcgttg 
cgggtgacgt aagagcgcgc ccgcagcttg 
gtagacgagg aggcgacgct ccccatctcg 
agtcgcggat ctttcgaccc tctcgagcgc 
gagcggtgga gtacgagctc gacggcgtgg 
cggagagagg tcgtgcgccl ttgcllgctc 
gcacacggaa gagtacgaga gctcagaaaa 
agatgacgga gcgtggggct cctcaccggg 
gccggggact cagggtgaaa ttctaagctc 
agagcataag acacgggcga tgtatacagg 
tcctgtgttt tggggggatc tgtgtatgaa 
ttgttctggg ttccgaatct ggaaaagtcc 
tcctcctttt atatcttaag gggataccac 
ttaccctacc ttttcataaa lggacggaga 
gataggacgg cccataccta cctccacttc 
gctgtctgct gacgacatga ccagtgtcag 
cacgcglcag tttagcaatc tacatgttgg 
ggttaggatg aagcctggca tatatctaac 



atcctcccta gccatgcgaa tctgatggta 26880 
gtcgcacccc gcagtggagt cgacaatctg 26940 
aggacatgcc ttgttaaggt cggtgtagtc 27000 
gggaacgacc accgggttcg ctagccactc 27060 
ggcgatggtg ggccggaacc ttgggggcca 27120 
tctacagccg acaccaccaa ccgggggcac 27180 
tgacactctg gacgaggaag cgccctccgt 27240 
gcgctcgatg ctggtgcggg cgtccggccc 27300 
cgacgaagga gtggcggccg aatggcgaac 27360 
tgacgacgcg gagccggtgg tagcagcacc 27420 
agcatctagg cgctgccgta ccgtcatgac 27480 
ggctggagac tccgggtcag ggacgacagg 27540 
gagcgcgccc tggggcgigc tgccgtcgcc 27600 
ccgttcttct ccatcgcccg cgatcggtga 27660 
ctccccccgc ttaggacttt ggcgtggagg 27720 
gttcggctcc ccgtcgtcgc cactcacact 27780 
ggccatcagg ctgaacagga aaagcttggc 27840 
acacacactg agtcccctac ctggcgcgcc 27900 
agaccgcgca ggcccccctt tgccggttcg 27960 
tctgtatgtg gaaggttcgc gaccgtcgaa 28020 
ttcgggccgc tgagaagcgt aataccctac 28080 
ggagctacaa agtatgagcc agcctctccc 28140 
agtccagtcc ccccctctaa gtgggcaagg 28200 
atgcaccatc tccctccttt ctgtggggac 28260 
tttgtatagt tgccgtccga atgaccttct 28320 
cgccgaaagc aggtgcgacg tgggattatg 28380 
actggtcaca aattgctcat tcctgtccac 28440 
cccttcttca cacaacatct tgcctgtaat 28500 
caggactaac gtgccatctc taggaggtaa 28560 
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cacgctagct ccagclgggg acgagcgcc! 
ggcgtgcgtc agatcgcctg tcgccaccta 
acagaccgga taaacgagtg cactgcactt 
accgcaataa atgtggttag gtgagccccg 
aaaaacccac gaggggtcgg ggcgcctcgg 
cccctcgggg ggactaagag gagggcgaac 
ggtgccaggc cacgtgggcg attgtgtctg 
cccatgtcat cgacaaggcc atccgaalgl 
ccacaatgca ccaaggtgca tgaccacaca 
agggtgaagc ctagcaccaa acgaccgcca 
ggcagcaaca catagatgag acttcgaaaa 
gatgctgcca tcgtccatct aaaaagatgt 
gggagagggt aacccttgac agcgccccaa 
tgccggtccg gtgaaccacc ggactaggct 
gatcaccgtc caccactcag aaccaccaca 
caccgcaccg acgccccttc gtcggccgac 
ccaggacccc \ccgt tccac cacccgccgg 
ccgggactgg gtgacattgc ttcggcagcc 
caatccaacc tagaaactcc ccgcaaaaga 
ccgaccggca acgaggaaga caacccatcg 
cctgcgccaa tgccggatac gctgtcgctc 
ttgcctggtc tccgcgcccc tcctggctgc 
ccacgacggc cgcccggcta ccgaggcctg 
cagcgagcca cggccgtcgc gctgtlgccg 
ggccgagcat gccactgagc cgccgccgct 
cacacgaccg cacgccgcca cgctccgcct 
gcgcacctcg cccgcccgct gccgagccgc 
gtccctagcc gcctcgtgcc gccgccacgc 
ccttgggggc gccggatccg ccgcctcccc 



agaagcccic gtcctgacgg gatggggcga 28620 
acccgcgatc tgaccggtct gtgactggtc 28680 
cgttacatgc ggcgtgacac gctcagccaa 28740 
ctgtgctcac ctaacccata cacgcggagc 28800 
ccctcggggc cgaggcgggt gcggtccgac 28860 
acalcaccct cgggcccgac gtcccccgag 28920 
cctcaaacct ctagtcatga tactcclgat 28980 
attaaggagt aaaagttaca agaaaaaaca 29040 
ccatacacta cccccaagca caaaccactg 29100 
ctaagtgtga ccaaacgccg ctaggcctac 29160 
cgatgccacc aaggtggtca cgacatgtag 29220 
ggttttcacc cagagaaact catcaagaag 29280 
ggaggttacg acgcccgaag gcgtagccgc 29340 
tccgcctagg accctatagc cttgatcgca 29400 
cagacaaaag gtagcacgta gcttccaccg 29460 
tccatcgaac caccatccct gagagctggc 29520 
ccgccttgcc agttttggcc aaaggagaac 29580 
tgagcttccc ccgctggcga gctgctgtct 29640 
aggggatgag ctctaggaag ggcgagggtg 29700 
actccagctc cctttgcact accatctggg 29760 
cggctccggc gccacccacc tgcaccccct 29820 
gtcgcgccgc ccagctggcc gctaagggca 29880 
gccgcgccat gggacagctc gcgctggcac 29940 
gcgccagcga gcacaaccgc cagctccaag 30000 
gccgcccggg ccggctgcac gtcaccggcg 30060 
ccgcgcccga ggcagcccca tgccattgcc 30120 
caccgcgcac cttgctgagc cgccaccgcc 30180 
cagatccagg cgcgggatgg ccggatccgg 30240 
acaccgccac ggcgtcacca cctccgaccg 30300 
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cagtgagggc ttcgtcgttt gccccatcct catcgcgtcg aggaggaaga cgccaagaaa 30360 
aaagggcctc gccgctgcct tccttgctcg ctgccggctt cgccgccggc gagctccggc 30420 
ggcggcgagg tgggggagaa gaagtgggga gtgggcagct agggtttttt cgccccccaa 30480 
gccgcccgtg cgagagcgac ggtggggggg gggggacttt ccaacctctt ccagtgttct 30540 
agttctccac gttatgtaac tcaatttgtt taaccataga aagtaagaaa cctaccagcg 30600 
tgt taagctc tctttcattc cctttcttct tcctggtttt gcttccatca calgtcaagt 30660 
gaagggltct taactaccat tactcctaca calctaattt ttitctcaga tctttcgcag 30720 
gtatatattg atgctacatt ttatgatctt aagataatct ccttcacatt accctctgct 30780 
gaaactttag cttgaaccgt catcttcacc acaatttgag cccaatttgc acagagcaca 30840 
acgagcaata gcttgccctt acgttcatta tttagcatga actactacta actacccaag 30900 
aatcaataca ccggtttaat aacgccattt tatcacgtta atatatgltt cattcaacac 30960 
accggtlttg gcacagltgc aaacttgcaa taaattcttt cctacttctc catcccataa 31020 
tataacaaat tggtatgtct cgtctggtac taagttgcta tattatgaga tggagggagc 31080 
acttcttttc ttccaaaata taagaatata gtattggatt agatattatc tagattcacg 31140 
aattcgatta ggttgtctag atttatagtt gtatgtaatg tataattcgg taataggtta 31200 
ttacctctcg ggatggaggg agtagttttg actttttttt ttcttataaa tcgctttgat 31260 
ttttatatta gtcaaatttt atcgagttta actaagttta tagaaaaaaa ttagcaacat 31320 
ttaagcacca cactagtttc attaaattta gcatggaata tattttgata atatatttgt 31380 
tctgtgttaa aaatgctgct atatttttct ataaacgtag Icaaatltaa ataagttaga 31440 
ctaaaaaaaa tcaaaacgac ttataatatg aaatggagga agtagtagac tataacaaat 31500 
ttaaaccgtg ctttgatttt agagcatcac taatatgtta gcaataatct atccctaaaa 31560 
tttatttttt ttcctaaact gaaaatagga agtggaaata ctcctccatc taagagagag 31620 
cctaaattca ataaaaaact aaaaaactaa aggtggatcc ctctattaaa ctaccgcaaa 31680 
aaatttatgt tttttttctc ttccacgcgc gcagaacaga tatctcgatc aagttagcat 31740 
gtaaaatttt taaagagata ccttatacga ctccttccgt atttccaaaa gcaaacggat 31800 
ttaaaatctg actcaaataa agatctatat atccaattta catgacacat gtttcgccga 31860 
atttttatat taataataat taatattttt aaaattaaat tattagcaat ttgtttggag 31920 
gatttatcaa aacaggatgg acgttgttta taacagcgtc tagacctaga cgcgcttgca 31980 
aactgcggcc acccttttat cacacaaatt tttgacaatt tgacactttc caaaaattaa 32040 
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ttttataaat taaccgtgac caaaacttat ttaaaaataa tctttttgtt gagcgcaaaa 32100 
tcgtatactt cagcgccaaa tagcacggcg ccgacctccc ccttcccctc ccctctatcc 32160 
tccactgctg ccgcccacct ctccgtatca gctgcgtcgc gttggtttcc gccggcgctg 32220 
ctgctgctgc accagtccgc tagggcgggc gggcatggcg cgccgcgccg cttcccgcgt 32280 
ccgcgccggc gctgttggcg cccttcgctc ggagggctcg acccaagggc gagggggccg 32340 
cacggggggc agtggcgccg aggacgcacg ccacgtgttc gacgaattgc tccggcgtgg 32400 
caggggcgcc tcgatctacg gcttgaactg cgccctcgcc gacgtcgcgc gtcacagccc 32460 
cgcggccgcc gtgtcccgct acaaccgcai ggcccgagcc ggcgccgacg aggtaactcc 32520 
caacttgtgc acctacggca ttctcatcgg ttcctgctgc tgcgcgggcc gcttggacct 32580 
cggtttcgcg gccttgggca atgtcattaa gaagggattt agagtggacg ccatcgcctt 32640 
cactcctctg ctcaagggcc tctgtgctga caagaggacg agcgacgcaa tggacatagt 32700 
gctccgcaga atgacccagc ttggctgcat accaaatgtc ttctcctaca atattcttct 32760 
caaggggctg tgtgatgaga acagaagcca agaagctctc gagctgctcc aaatgatgcc 32820 
tgatgatgga ggtgactgcc cacctgatgt ggtgtcgtat accactgtca tcaatggctt 32880 
cttcaaggag ggggatctgg acaaagctta cggtacatac catgaaatgc tggaccgggg 32940 
gattttacca aatgttgtta cctacagctc tattattgct gcgttatgca aggctcaagc 33000 
tatggacaaa gccatggagg tacttaccag catggttaag aatggtgtca tgcctaattg 33060 
caggacgtal aatagtatcg tgcatgggla ttgctcttca gggcagccga aagaggctat 33120 
tggatttctc aaaaagatgc acagtgatgg tgtcgaacca gatgttgtta cttataactc 33180 
gctcatggat tatctttgca agaacggaag atgcacggaa gctagaaaga igttcgattc 33240 
tatgaccaag aggggcctaa agcctgaaat tactacctat ggtaccctgc ttcaggggta 33300 
tgctaccaaa ggagcccttg ttgagatgca tggtctcttg gatttgatgg tacgaaacgg 33360 
tatccaccct aatcattatg ttttcagcat tctaatatgt gcatacgcta aacaagggaa 33420 
agtagatcag gcaatgcttg tgttcagcaa aatgaggcag caaggatlga atccggatac 33480 
agtgacctat ggaacagtta taggcatact ttgcaagtca ggcagagtag aagatgctat 33540 
gcgttatttt gagcagatga tcgatgaaag actaagccct ggcaacattg tttalaactc 33600 
cctaatlcat agtctctgta tctttgacaa atgggacaag gctaaagagt taattcttga 33660 
aatgttggat cgaggcatct gtctggacac tattttcttt aaltcaataa ttgacagtca 33720 
ttgcaaagaa gggagggtta tagaatctga aaaactcttt gacctgatgg tacgtattgg 33780 
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tgtgaagccc gatatcatta cgtacagtac tctcatcgat ggatattgct tggcaggtaa 33840 
gatggatgaa gcaacgaagl tacttgccag catggtctca gttggaalga aacctgattg 33900 
tgttacatat aatactttga ttaatggcta ctgtaaaatt agcaggatgg aagatgcgtt 33960 
agttcttttt agggagatgg agagcagtgg tgttagtcct gatattatta cgtataatat 34020 
aattctgcaa ggtttatttc aaaccagaag aactgctgct gcaaaagaac tctatgtcgg 34080 
gattaccgaa agtggaacgc agcttgaact tagcacatac aacataatcc ttcatgggct 34140 
ttgcaaaaac aatctcactg acgaggcact tcgaatgttt cagaacctat gtttgacgga 34200 
tttacagctg gagactagga cttttaacat tatgattggt gcattgctta aagttggcag 34260 
aaatgatgaa gccaaggatt tgtttgcagc tctctcggct aacggtttag tgccagatgt 34320 
taggacctac agtttaatgg cagaaaatct tatagagcag gggttgctag aagaattgga 34380 
tgatctattt ctttcaatgg aggagaatgg ctglactgcc aactcccgca tgctaaattc 34440 
cattgttagg aaactgttac agaggggtga tataaccagg gctggcactt acctgttcat 34500 
gattgatgag aagcacttct ccctcgaagc atccactgct tccttgtttt tagatctttt 34560 
gtctggggga aaatatcaag aatatcatag gtttctccct gaaaaatata agtcctttat 34620 
agaatctttg agctgctgaa gccttttgca gctttgaaat tctgtgttgg agttcttttc 34680 
tcctacagtt gtattagagg agggatcttc tctttatgtg taaatagcga ggtatgtatg 34740 
tcacctctcc gaattatttt tactctggtt cctagacggt aaacaagcaa ttatgttctg 34800 
ccittgatgc cagaaaaaac acaaaagttt gtcgttatct ctactaacgg atcataaagg 34860 
aattlgtaac tggagtttca aacttaattt gtctaggcag tagttttggc attagatcca 34920 
acattgtgta ggattcattt gtgtgtatca atctataggg tttcattaaa tttcgttaat 34980 
glgtactgtt taggtgttga atagtttgac ttgtttttta actgaacaaa agatactgaa 35040 
atcgttccat tcaacaaaca catgttccgt taalgaaatt attgtacgtt accttttgtt 35100 
ttcttactca caagtgtcct cttltcttat atcctataga ttggtacaac aaattattga 35160 
ttcaattttg gttttgaaca ttgatgatcc tccctgcact attggtgcag ctgctcttct 35220 
attcattttg tgaagtgatg tgagtacctc tcaatcccat ccttatgctt ctgtgcatgc 35280 
t teat t ccaa ttttttaege atatcgattg ttttctttta tataacagtc cataaagata 35340 
atcacatcat gacaaagtta tttatttcta caglatagtt atataagtat tcaccagttl 35400 
tccatgaata ttttggcatg tgattacaaa gaagattatt tgagaaaatc catgclttta 35460 
tltcatcttt ttgtttgaag ttgaacttta atttatggtg laaatttcag ttattattgc 35520 
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tagcagctcg tactctttaa tggtataact 
tcttgttgtt caggttcaag aaaatcattt 
tcttaaatta ttaaatcctc caglgaatct 
caaacttctt ggaatcagta aagttcaaat 
catttcagag gaaatccttt caaaagttga 
gtcgctcgac aaagatgaca agaaaacaac 
tttgttgaca atggaaaatg catgcttgtc 
aaatttcttg gtcatgattt ccctttatga 
gcagttgtca acttgtcata agttacatag 
cttagccctt ttcttcacct gcttttgatt 
ttggagtagt ggactgcact agaagcacct 
cttatcaaat atttgattgt tacagagact 
tttaaagaga cattaaggga gatgggaggt 
tttcattcaa cattggaggt gagatctcgc 
caactctaat ggattgtgca ggcttgttcc 
agccacatga aagtaatgct tgtccagata 
tggaaaatgc ttgaggtagt aactattttc 
aattatgttg ttttatgltg caaaaatagt 
gttagtatct tacagtlcct ctgatgatta 
taataacact tgacatatct acaccaagtg 
gctatact tg ctgt tcttgc atgtgtaagc 
atgcttggtg ttcctatctg tgtacttttt 
gctgataaac tgaataaata aattaacaat 
aatctcataa aggatacatc cacttcagct 
agtgctgctc tcctcttgaa atgtttgaaa 
gataacaagg taatgctcct tatatgttct 
glactatctt ctctcctgat ttgttctgtg 
tctgcttaac aattttcttt tcttcctgaa 
cttcttgcag acccatttgc ttaatatgag 



tcacttgtgc ttattctcca atatctccct 35580 
gttggattca gaatctggtg tccattttct 35640 
tgttgattcc aaagcaccat cgataggttc 35700 
gcttaatgga tcaaataagg attctgactg 35760 
agagattctc Itaagctgtc aagtgatcaa 35820 
aaggccagaa ctgtgtccaa agtggcttgc 35880 
tgctgtttca gtagagggta agttttaatc 35940 
ccattatatt tatttatatg agccaaataa 36000 
cacctatttg caatattcat gggtggtttg 36060 
gatgacttcc atctgtgttg cagaattgaa 36120 
atggccattg tcatactagg aaggttttcc 36180 
tctgacacag tgtccagagt tggaggaaat 36240 
cttgatagta tttttgacgt tatggtggat 36300 
taacatcgca tattttacat ttcctttgtt 36360 
ttttcgccat titagcttta atgtgcttga 36420 
catagccaaa ggttgttata ttttggggca 36480 
atcaggacat ggaaaattgg ctgcaacaca 36540 
tttttaatac ttttttattc tgcatgtggt 36600 
tatcccccac gataataaca cttgaaacga 36660 
aacattattc atttggatgt tacttttcca 36720 
aagtttggag taaattgcgc attaatttaa 36780 
attccccaac taataatgca atcatattac 36840 
atacttctgg tggcaaacct tgtgtatcag 36900 
ttggaccgaa atgaaggaac atctttgcaa 36960 
atattggaaa atgccatatt tctaagcgat 37020 
gtttcagttt agtacccatt tccttcttct 37080 
caaaatgtgc aaacagtgcg actttgtatg 37140 
aaagcaatat gaactcttac attcattttg 37200 
iagaaaattg aacccgaaac gctccttgct 37260 
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ttcttttgtt ggtgtcatta tcaatactat tgagttatta tcaggtattt ttcttaataa 37320 
tacaatgtgt tcgctaacac aataaaatgt tttaaacatc cagtatgtta aagttgcagt 37380 
ctgacgccta ttttgitttg ctgcagctct ttcaatactt cagaattctt ctgttgtttc 37440 
cagctclaca tatccgaaat cgtctaaagt ctctcaacag agttactctg gtaataacaa 37500 
acaccaatlt tgtttgatca gttgatctcg ttggcttttc tatgcactgt ctcaatat.ag 37560 
tttggtcgcc attcaagtct cactacagal gttgaacttg gcctgacacc aaatatltat 37620 
aaaatgctac ctgatatttt taatatttca tgtttcctga cccagattat cttgttggtt 37680 
cctcgtataa gtttaattag tgacattctt gaagctttgt tatgcagcag atgtcatggg 37740 
gggaacllca tttaalgatg gaaagagcaa gaactcgaaa aaaaaaaact tttgtcgaac 37800 
cagacacglc attgttgctt atcttcaaaa tcagaagttt ctcatattac tatatcttct 37860 
ggtagtgatg ctggtctgtc acagaaggca ttcaattgtt ctccatttat atcaagcaat 37920 
ggggcatcaa gtggttcatt aggcgagagg cacagcaatg gtagtggttt gaagttgaat 37980 
ataaaaaagg atcgtggcaa tgcaaatcca attagaggct caactggatg gatttcaata 38040 
agagcgcaca gttctgatgg gaactccaga gaaatggcaa aaagactccg tctatcttaa 38100 
aatgtaatca ccgacagtgg tggtggtgal gacccttttg catttgaccg ccgcgtcggc 38160 
gtcgccacca cgtaatcgcc cacgtcgctg cccccgctgc cacgtcgtcg accgcgcacg 38220 
gtaalcacac gcatclcgag gccgccgcta gctgatatct tctcatccgg ttgatttgtg 38280 
attttggcgt ttttgcagtg gtgatggcgg ggggcgaccg tggccgaggc gtggaglgcc 38340 
atccgcatca gggtglatcg gccgcgctgc tccgccctgg tccgcaggct (Iggcggcga 38400 
gctggcggcg gagggagact gtggtgagat cggatttcgc cgctggtggt gtcgctacca 38460 
tgggggattc gccgcaggcg ctctcaggtt tgcagcctcc tccactctct tcccttttlt 38520 
attttttttt ctcgcaaaat gtgttgtgat gttcgtctcg ctgggcaggc ctcatagcca 38580 
ttaatgtagt ttgctggaac atttacaltt ggaacgttgt tggcaattgc ttgacaaaat 38640 
gtggaattgt ggaggggaga aaaatcattt gaacctgcag Igacaaaatt gccatctcta 38700 
attttaaaac tgaaggtgtg gaaatcaaac ataatcattg ccagcgcatc attcttgtta 38760 
accaccatga tatattgttg gttataacag ttagctccac accaaccttg aagglgtcaa 38820 
tagaatgttt agtataaatt gaggagaaca ggcagttgtt aagactttct aaagaacttg 38880 
tagcagctaa tactagctat tgtgcatltg tgtttcatgg aatttgagca gcaatggata 38940 
Ittcttacta agatglatga tgcaaaacaa aaaactalgt ctatacagtt tacatgtaat 39000 
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gtgcggatgc aaataaaatc atgtacatgg acaaactcat gggattcata ccgaattcca 39060 
gaattgcatt tcttalgtgg ttacttttgt tgttgatttg gttaccagac atcgalgtga 39120 
tttcaagggt cagaggggtt tgcttctacg cggtggctgc agttgcagca atctttttgt 39180 
ttgtcgccat ggttgtggtt catccacttg tgctcctatt tgaccgatac cggaggagag 39240 
ttcaggaaaa aaatttgaaa atacccattt tttgaaaaag atttacgttt atatacacta 39300 
gtatgaagaa tttgcgaaaa tataactaat ccgcagatcg gttatgcggg agcgcaacaa 39360 
aagtatggcg tggcggcgcg gagtggacgg ccgaggcgtt cgcgcggaat ggggctgcgg 39420 
gaccgagcca gtctcgcttg ccggtaacgc ggaaccggia cgctcccgca gcgccagtgt 39480 
gcggaaccgc ggcgccaaca tttttttacl gcatggcact gtgtttaata ctgtttgaca 39540 
ctgtttctgg tactgtttta cacagttccc gggtcagttc cgcacaatgg aggcgcggca 39600 
ccgaccatga acaatgtgtg aacagtgctg cacagggtta aaacagtgta taaactgcgc 39660 
tgcacagtgc tggagtcgct ggccactgcg gttccgcgtt ttggaaccgc gggaccgtcg 39720 
cgattccgcg ttttggagcl gccggaccat gacggttccg cgcaggatcg Icggtcccgt 39780 
at 1 1 tgaatc tgcggaaccg tcgctgtccc gcgtttccgt ttcgcgggat gcgtatattt 39840 
ttataaaacc tctccatgca tgtatataaa cataaattat tgaaaaaata aglatatttg 39900 
caaatttttt tcgagagctc agcactacat tgcaaagatt tgggcaactc tgacaatttc 39960 
catgttctac aagcttgacg tcgagggaat ggagaacctg ccaccgaata gtagccctgc 40020 
tatctatgtt gcgaaccatc agagtttttt ggatatctat acccttctaa ctctaggaag 40080 
gtgtttcaag tttataagca agacaagtat atttatgttc cgaattattt galgggcaat 40140 
gtatctctta ggagtaattc ctttgcggcg tatggacagc aggagccagc tggtatggct 40200 
gtagtctcat ccctgcttlc ttaagtagac atatatgcaa ttacagaatt tggtaaacaa 40260 
acaagatttt atgaatcata tatgattttg gggaaaacac caaactclct ttggtggclg 40320 
ccttgaacat agttctattc acacagttat agcaccttcl ttaaaatgaa gaactttgtt 40380 
gcatacacat atggccaaac cacataatga attttgttta tttctatctt tgaatgttag 40440 
caccttattt tcatgcatat catgctaatt tgcttgccca cgttgagtgg gaattttttt 40500 
ccatgtttta taatttatat atgltctaga cttctagtcc acaatttatc tacttcatgt 40560 
tcctgagcct ctagtatggc tggtagcaga ctaggtgctg agtgctgtcc atttttgcag 40620 
actgaagaga ggagaaatac aggactgtcc gttgttagtc agatttgtaa aaatagactc 40680 
tgatgtagtt tattttagcc cctattttat at t taacaat acaaatatat aacgtatcct 40740 
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aagaacttat cgtaat t tag gagaagttgc 
aatgtgtgct cgagtctgtc aatgcaatcc 
caggctagga tcgaacactg aatggtaaga 
ctgccacgcc gattaagcag tagaacaaag 
tcattgaaaa tcgaagtgaa aatgaaccaa 
tatatactcc acaaalttat tggtaagatt 
gttaaggata tcaaalttgg ggaagcacca 
ataacgttat ctactaggtt caaatcctag 
gatttattgt taattltagt ttctttaaga 
aataatgttt tgtatlggat tcacttgtgt 
ttctgcgttc tgggtatgaa tttgacgaga 
gcgtgtcaag cgatcttata tggtatgcgc 
aacgtgtgat attgatacga tgttcctttc 
aaaagaaggg gtattactaa aaaccaaaat 
agtgtgcacg agcaatagct tgcccttacg 
acgcaaaaat caattcaccg attattaaac 
gt t teat tea acacaccggt tttggcacat 
tegttacata gcataagaga ttttaggtga 
tagaatgtat caccgcctcc aegcegggag 
ggagtggtcg aggggaacca gtagggtgcc 
eggegagaca ggaggaagag ggggatatgg 
ggggggagcg gcgacgccgg tgaggaaggg 
ggggaggaaa aatgatttta gagttagggt 
atcaatcagg accgtccatc agateggaca 
ccactcctag gltgcccaca etattgggee 
tttaggtcct ttaagttgcc tctgaattgt 
cataggecat gtgtacgctc cgcacagaat 
ctcaaactcc taaaaccagt gcaaatcttt 
geatatttge tagtgacata taatatgaaa 



tegtttcatt aaattaaact gtgaagtaaa 40800 
tgtgttcttg tttgaagata tggtgtaggg 40860 
ctgcttctgc cttcatttgt gcacltggtg 40920 
taattttgtc gtgcacaaat gagttatatt 40980 
aagatagaag aaaaggggaa acttggtaat 41040 
tgatattaga cgctcgatta cttggcttaa 41100 
aaggaattat tgtgaaggag ttgtgggtgc 41160 
tgactatgaa tattaatgag taaggtaagg 41220 
ttgtgtccgg gtacaccatt cggtaagtgt 41280 
tacgtgcatg tgatttacct Ittcatttgt 41340 
ttccatggtc agctcaacat atcagttact 41400 
acaagegatt gtataeggat atgacagtat 41460 
ctttataaag gaacaaagac ttttttaaaa 41520 
gtcaaaaaca aaatatcagt gcacatggca 41580 
ttcattattt agcatgtact actactaact 41640 
tgttaacatc attttagcac gttaacatat 41700 
ttacaaactt gcaaagttgc aatactccct 41760 
atgtgacaca tctatccaaa ttcattatac 41820 
ggagagegee gccggtggag aaagggggag 41880 
ctccccgtcg ccgcctcccc gtggccgcgc 41940 
agcggcgccg ccggtgaggg cgcgcgcgcg 42000 
aaggggagtg gtggctttga gagagatagg 42060 
ttgggctgct gagtttttat atagateggg 42120 
actaeggett ctcccgcgtt gggccgggtg 42180 
acatgtaege tccgcgtgaa ataagttcac 42240 
tcccaggccg gccgcactat tgggccaccc 42300 
aatttcgett tagctccctt aatttgtccc 42360 
aatttttagt tcacccattg caactcacgg 42420 
cgaaggatgt agcagactat agaatttaaa 42480 
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ctgtgctttc attttagagc atcactaact gttatttaga tttttattta aataaatgct 42540 
gaaatgatgt ttttattatg aaaattagca ataaagctcc caaaatttca aaaaaaaatt 42600 
aaaagagatt tattaatcat ggttaattta attaaaaatt aaatctaacc atatcatatt 42660 
at t tcacggt ccgtgatgag gaaatggcag ctgctatcac ttacggtggg agagaagggg 42720 
cattgtttat ttttataact atctcttata actcccatga aactataaaa taaatataat 42780 
cattatcata acattagttt tttttccatt gcaacgcaag ggtaattttt cagtacaata 42840 
aaaaaaataa aagtgggcca ttctgaacgg aaatttctgg ttttttttcc caagagcgcc 42900 
gcacacaact gcgcaagaga tcgatcgcga tcaccctgct cgtcgccgat ctcctacacc 42960 
atccctgcca tctccttccc ctccactggc tgctgctgca cctgtcagct agggcgggca 43020 
tggcgcgccg cgccgcttcc cgcgctgctg gcgcccttcg ctcggagggc tcgatccaag 43080 
ggcgaggggg ccgcgcgggg ggcagtggcg gtggcgcgga ggacgcacgc cacgtgttcg 43140 
acgaattgct ccgtcgtggc ataccagatg tcttctccta caatattctt ctcaacgggc 43200 
tgtgtgatga gaacagaagc caagaagctc tcgagttact gcacataatg gctgatgatg 43260 
gaggtgactg cccacctgat gtggtgtcgt acagcaccgt catcaatggc ttcttcaagg 43320 
agggggatct ggacaaaatg cttgaccaga ggatttcgcc aaatgttgtg acctacaact 43380 
ctattattgc tgcgctatgc aaggctcaaa ctgtggacaa ggccatggag gtacttacca 43440 
ccatggttaa gagtggtgtc atgcctgatt gcatgacata taatagtatt gtgcatgggt 43500 
tttgctcttc agggcagccg aaagaggcta ttgtatttct caaaaagatg cgcagtgatg 43560 
gtgtcgaacc agatgttgtt acttataact cgctcatgga ttatctttgc aagaacggaa 43620 
gatgcacgga agcaagaaag atttttgatt ctatgaccaa gaggggccta aagcctgata 43680 
ttactaccta tggtaccctg cttcaggggt atgctaccaa aggagccctt gttgagatgc 43740 
atggtctctt ggatttgatg gtacgaaacg gtatccaccc taatcattat gttttcagca 43800 
ttctagtatg tgcatacgct aaacaagaga aagtagaaga ggcaatgctt gtattcagca 43860 
aaatgaggca gcaaggattg aatccgaatg cagtgaccta tggaacagtt atagatgtac 43920 
tttgcaagtc aggtagagta gaagatgcta tgctttattt tgagcagatg atcgatgaag 43980 
gactaagacc tgacagcatt gtttataact ccctaattca tagtctctgt atctttgaca 44040 
aatgggagaa ggctgaagag ttatttcttg aaatgttgga tcgaggcatc tgtcttagca 44100 
ctattttctt taaltcaata altgacagtc attgcaaaga agggagggtt atagaalctg 44160 
gaaaactctt tgacttgatg gtacgaattg gtgtgaagcc cgatatcatt acccttggca 44220 
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ggtaagatgg atgaagcaat gaagttactt tctggcatgg tctcagttgg gttgaaacct 44280 
aatactgtta cttatagcac tttgattaat ggctactgca aaattagtag gatggaagac 44340 
gcgttagttc tttttaagga gatggagagc agtggtgtta gtcctgatat tattacgtat 44400 
aacataattc tgcaaggttt atttcaaacc agaagaactg ctgctgcaaa agaactctat 44460 
gtcaggatta ccgaaagtgg aatgcagatl gaactltgtt agatttaatt ggataattaa 44520 
tccatttaaa tcaattaaat caaataaatt ccaaggctca ttatgctagg aattcatgtg 44580 
aattcattct tctatgggat atcaatggga tgaagagttt (gagaattaa tccatttgat 44640 
taaggaattg gtaacltata tcaattaatc ctaattgatg gatggttgat ggltgtgtag 44700 
tggaggatgg ttcatggcta gttgatgaca attagttgct ctattcctct tcctattcca 44760 
ttggtaactt acatcaatta ctcttaattg attgttggtt gatggttgtg tagtggagga 44820 
tggttcatgg ctagttgatg acaattagtt gctccattcc tcttcctatt ccatgactct 44880 
tactcttcat cttccattcc tcttataaaa tgagaatgga tttgatctcc cgcgagaaga 44940 
agaagacaca ctttcatcca ttttcaaaag ctgttgctgc lacggtaatc ccatcccgac 45000 
gagtgtgtgc acacgcgitg ggagagtagg cctccgaaac cacgcgttgc tgcgacgttt 45060 
gcacagacgg gcgggcgatc aggtttttgg ggagcgcaag gcgcgactac tcactgttcg 45120 
tcaacatcta cttcatcttc accaacatgt cgaacactgg agacaaggag aaggagactc 45180 
ccgtcaacac caacggaggc aatactgcct caaactccag cggaggacca ttcttggggt 45240 
ataaccttat tacatlattt caattagaag ttttactgtt aalgttcatc gcaatgtcaa 45300 
cattgtgtca ttatgtgatt gttgatgctt attcaacgtt aagcatgctc atgttgatta 45360 
cattcaccac tatcactgga tcaaatccta ttgtaaatat catgtttatt atcttgttat 45420 
tttggattaa aatatgccga attatgacca aatttccaac aaacttagca catacaacat 45480 
aatccttcat ggactltgca aaaacaaact cactgatgat gcacttcgaa tgtttcagaa 45540 
cctatgtttg atggatttga agcttgaggc taggactttc aacattatga ttgatgcatt 45600 
gcttaaagtt ggcagaaatg atgaagccaa ggaltlgttt gttgctttct cgtctaacgg 45660 
tttagtgccg aattaltgga cgtacagatt gatggctgaa aatattatag gacaggggtt 45720 
gctagaagaa ttggatcaac tctttctttc aatggaggac aatggctgta ctgtlgaclc 45780 
tggcatgcta aatttcattg ttagggaact gttgcagaga ggtgagataa ccagggctgg 45840 
cacttacctt tccatgattg atgagaagca cttttccctc gaagcatcca ctgcUcctt 45900 
gtttatagat ctt ttgtctg ggggaaaata tcaagaatat catatatttc tccclgaaaa 45960 
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atacaagtcc tttatagaat ctttgagctg 
gttggaattc ttttctccta cagtccgatt 
agcgaggtat gtatgtcacc tctccgaatt 
age lat t ate ttctggtgtt gatgecagaa 
aacggatcat aaaggggttt gtaactggag 
atatattgat cctacatctt atgatcttaa 
aactttagct tgaacegtea tctacaccac 
agcaatagct tgcccttacg ttcattattt 
caatacatcg gttattaaac tgtttgtaca 
tatgtttcat tcaacaccac accggtlttg 
actacttctc cgccccataa tataacaatc 
aeggatgaag tacttctttc cttccaaaat 
ttggattcac gaatttgatt aggctatcta 
gtaataggtt attacctctt tggatggagg 
tcatattata agttgttttg acttttttct 
tatagaaaaa aaattagcaa catttaagca 
atatattttt ataatatgtt tgttttttta 
tagecaaatt taaagaagtt (gattacgaa 
tgaggatgta gcagactata gcaaatttaa 
agagatagee taaatcttat cttaactaat 
ttaaattttc gtccgtaaat ccgattgaaa 
aagaacagaa aaaataataa aaagcacaca 
gatgccgccg aatccgctcg agcgccgccg 
ategcaegtg gtatgggagg gcgccgccgc 
aaggaaatgt gagggagggt taggtttttc 
atccaagctg tccatcagat cagacggctc 
ttgatgggcc gagggaaggc eggagggteg 
aggtaaagta gaatttattt gegggctgag 
gaaattggga attttattta aataaatgtt 



ctgaagcatt ttgcagcttt gaaattctgt 46020 
agaggaggga tcttctctgt atgtgtaaat 46080 
altttgactg tggtlcctgg actgtaaaca 46140 
aaaacacaaa agtttgtcgt tatctctact 46200 
tttcaaactt aaggtatcta ggcagtaggt 46260 
gatgalatcc tlctcattat cctctgctga 46320 
aatttgagee ccttagcaca gagcacaacg 46380 
ageatgeact actactaact acccaataat 46440 
gtttaataat gtcattttat caegttaaca 46500 
gcacagttgc aaacttgcaa taacattttt 46560 
tegttccata etatattget atattaeggg 46620 
ataagaatct agtcctagat tagatattat 46680 
gatttgtagt cgtatgtaat gtctaattcg 46740 
gagtagtttt tatttegtae tccctctgtt 46800 
tagtcaaatt ttattgagtt tgactaaatt 46860 
ccacattagt ttcattaaat gtagcatgga 46920 
tlaaaatgct actatatttt tctataaatg 46980 
aaaaaatcaa aatgacatal aaiatgaaac 47040 
actatgettt tattttagag catcaccaaa 47100 
taaaatattc ataattttcc tttegtcaca 47160 
tccaactaga caatccaaaa aatagagaaa 47220 
aatcttatct caatcccgcg ggaagctgee 47280 
ccgccgctca eggggaaega tgtegctget 47340 
cgctgcttgg gagataggat atggagagag 47400 
cccattcgta tettcagega caeggaggeg 47460 
agaacgcctc catcttcagg ccgcgcatgc 47520 
aacaaaegta gtcagaggag gagttggagg 47580 
atagtaaatg gactgaaaat ggeccataga 47640 
gaaaaggtgt ttatattatc aaaattagaa 47700 
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attaagctcc gaaaatttta aaaaatattc 
taaaaattaa atccaaccat atcatattat 
tgttgtcgct tacggtggga gagaagggac 
ctcccatgga actttaaaat aaatataatc 
ttttttcccc aagagcgccg cgcagaagag 
ccggccgatc tctcattctc tccacgccct 
ccatctcctc c t tcccc tec cctctatcct 
acaaactgeg ttgcggcgtt ggtttccgcc 
gggcatggcg cgccgcgccg cttcccgcgc 
ecaagggega ggaggccgcg eggggggcag 
egaattgetc cggcgtggca ggggcgcctc 
cglcgcgcgt cacagccccg cggccgccgt 
cgccgacgag gtaactcccg acttgtgcac 
cgcgggccgc ttggacctcg gttlcgcggc 
agtggaagcc alcaccttca ctcctctgct 
egaegcaatg gacatagtgc teegcagaat 
ctcctacaat aatct tc tea acgggctgtg 
gttgetgeae algatggctg atgatcgagg 
taccactgtc atcaatggct tcttcaaaga 
ccatgaaatg ctggaccggg ggattttacc 
tgcgttatgc aagggtcaag ctatggacaa 
aatggtgtca tgcctgattg catgacatat 
gcatttgtct ggacactatt ttctttaatt 
gggttataga atclgaaaaa ctctttgacc 
tcattacata cagtacactc atcgatggat 
tgaagttact t tctggcatg gtctcagttg 
ctttgattaa tggctactgc aaaattagta 
agatggagag cagtggtgtt agtcctgata 
tatttcaaac cagaagaact getgetgeaa 



aaagagcatt at taatcatg attaatttaa 47760 
ttcacggcgc gcagtaggaa aatgegcage 47820 
attgtttatt ttcagaacta tcttttataa 47880 
attattatag cattagtttt tttctgtctt 47940 
ategatcgeg atctccctgc cccgacgtcg 48000 
gctcgtcgcc gatctcctac accatccctg 48060 
ccactggtgc cgcccacctc teegtataag 48120 
ggcgctgctg ctgcacctgt cagctagggc 48180 
tgltggcgcc cttcgctcgg aeggctcgat 48240 
tggegecgag gacgcacgcc acgtgttcga 48300 
gatclacggc ttgaaccgcg ccctcgccga 48360 
gtcccgctac aacegcatgg cccgagctgg 48420 
etaeggcatt etcateggtt gctgctgccg 48480 
cttgggcaat gtcattaaga agggatttag 48540 
caagggcctc tgtgccgaca agaggacgag 48600 
gaccgagctc ggttgcatac caaatgtctt 48660 
tgatgagaac agaagecaag aagctctcga 48720 
aggaggtagc ccacctgatg (ggtgtcgta 48780 
gggggatlca gacaaagctt acagtacata 48840 
tgatgtlgtg acctacagct etattattge 48900 
gecatggagg tacttaccac gatggttaag 48960 
aatagttatt tcttgaaatg ttggatcgag 49020 
caataat tga cagtcattgc aaagaaggga 49080 
tgatggtacg tattggtgtg aagectgata 49140 
attgettgge aggtaagatg gatgaagcaa 49200 
ggttgaaacc taatactgtt acltatagca 49260 
ggatggaaga cgcgttagtt ctttttaagg 49320 
ttattacgta taacataatt ctgcaagglt 49380 
aagaactcta tgtcaggatt accgaaagtg 49440 
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gaacgcagat tgaacttagc acatacaaca 
tcactgatga tgcacttcag atgtttcaga 
ctaggacttt caacattatg attgatgcat 
aggatttgtt tgttgctttc tcgtctaacg 
tgatggctga aaatattata ggacaggggt 
caatggagga caatggctgt actgttgact 
tgttgcagag aggtgagata accagggctg 
acttttccct cgaagcatcc actgcttcct 
atcaagaata ttataggttt ctccctgaaa 
gctgaagcat tttgcagctt tgaaattctg 
tagaggaggg atcttctctg tatgtgtaaa 
tatttttact gtggttccta gaclgtaaac 
aaaaacataa aagtttgtcg ttatctctac 
gtttcaaact taatgtgtct aggcagtaat 
gtttcattaa atttcatcta tgtgtactgt 
aactgaacaa aagatatgtc tgaagctttg 
aatatatttt ttatggaaca agattggatt 
atctcaacgt attattatgc acatgtacta 
tatttacctg tgtgaggcag caaggagtga 
cagaatataa gaagttttag agttggacac 
gtggagggtt gtgattgcat gagtagtgga 
tgtgattggt tgggaagaga atgttggtag 
ttattctaga acaatactgt tgtgctcaag 
ctcgatgggt tttgagctta atcctgggac 
acatggaata aaggatgaca gcatgatttc 
gataatagct gtaaatctat actactttaa 
ctgccaccac cccaccacca actctcaaaa 
tctccaagac atgtgggatc actgtcaatc 
gtggaaatca cggacagacc atggagctct 



taatccttca tggactttgc aaaaacaaac 49500 
acctatgttt gatggatttg aagcttgagg 49560 
tgcttaaagt tggcagaaat gatgaagcca 49620 
gtttagtgcc gaattattgg acgtacaggt 49680 
tgctagaaga attggatcaa ctctttcttt 49740 
ctggcatgct aaatttcatt gttagggaac 49800 
gcacttacct ttccatgatt gatgagaagc 49860 
tgtttataga Icttttgtct gggggaaaat 49920 
aatacaagtc clttalagaa (ctttgagct 49980 
tgttggaatt cttttctcct acagtcctat 50040 
tagcgaggta tgtatgccac ctctccgaat 50100 
aagcaattat gttatgctgt tgatgccaga 50160 
taacggalca taaagggatt Igtgactgga 50220 
tttgacatta gatccaaaac aatttatagg 50280 
ttaggtgttg aatagtttga cttgtttttt 50340 
ttctttacca aatgcagtac tgatcatcac 50400 
gtatagaatg gtttccgatc tgattatctt 50460 
atcatgaaat atctgatgga atgatgtttc 50520 
gatggataac accacatact ccctctatcc 50580 
gattattaag aaagtaggta gaagtgagta 50640 
ggtaggtggg aaaagtgaat ggtggagggt 50700 
agaagttgtt atattttggg gagtacatta 50760 
aagcgttcca aagatgtttc acaacctgtg 50820 
attcagtatc atgatctgtc tcattcttaa 50880 
tttgtctcta taatcttttg gctacccaca 50940 
aaggagtagt ggtggtggtg agtggtgaat 51000 
ttctgacatg tgggatcacl gtcaatccct 51060 
ccttctccaa accaatlgta tgatagaaca 51120 
caaccataat calcctlgcg agtlaataac 51180 
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aaatggagcg taaacttggc aagcaaaaaa ctcaaattaa ttctaaaatt aagctctagg 51240 
attcaaaata gatttcctct ctgcattgtg ctgttatgat ttttaattcc gtaacaacgc 51300 
aaatgcattt tgctagtctt ataaagaagg gttaatgcaa atattctgat taaatgattg 5 i 360 
tatctatgaa gtttgaatgc tagtggaagc tcctttgacc atgttttgtt gtgcgagcat 51420 
ttaagagagt gaagagaatg cttctttggt gctgltctgg tatggaagga tccacagata 51480 
aaattcaggt tctactgctt ctctgcttgt aattttcatg aagctgcagt gaataccttg 51540 
ttgaccactt gatctgttgc tttgaaggag aatatagtag tggccaaggt tggtgacggt 51600 
gatggtggca tgtgatcccc cagatcttca gtgacccaga gaggagggga cggcgcgtgg 51660 
tgagctacaa ggcatactca gtggagggca. agatcaaggc ctcccgtccg taggggactc 51720 
cgctgcatca aggccaactg ctccgaactg atcaattlct ggtacggatc acttctcctt 51780 
tcctttttlt tttcacctta agcactctct tgattcttcg ctgctacctc ccttaatttc 51840 
tttcaatala ttgtggcact tgalcatggc ggagacccac cttccagtgt gaatggaltt 51900 
tgtcaaagaa ctaaatttat tccattagct tattttctga ttacatggaa gacattctlt 51960 
tctggaataa atacagaact aaatcctgtt tcctgaataa aagttgttag tgtgtggcat 52020 
ggtgcatttc cgcgcttcta aattttataa aacctgttca ttcaatttga acctgcatcc 52080 
aatccaatat tttaggtgca gacaggtgct tgcggtcagg ttaaagaagt tggcaaaaat 52140 
gcttctgaag aaaggttaat tgttgtttca tctcaggagg taatatgcag atgattattc 52200 
caattggcat tgccttgcca tttttatcac gagtctttac aattttatat cctcctacat 52260 
attctttcca gattccagat gatccagtgt ctccaacaat tgaggcgctt attttgctcc 52320 
atagtaaagc aagtacactt gctgagaacc accagttgac aacacggctt gttgtaccat 52380 
caaacaaagt tggttgtatt cttggggaag gtggaaaggt aattactgaa atgagaagac 52440 
ggactggggc tgaaalccga gtctactcaa aagcagataa acctaaglac clgtcttttg 52500 
atgaggagct tgtgcaggta alttatttgg ccatacctac accagagatc catatat tac 52560 
ttttataact gcagttttta cttgttaaca tttcaltgtg cttttacatt tgttccaagc 52620 
tttcaggttg ctgggcttcc agclattgaa agaggagccc Igacagagat tgcttcgagg 52680 
ctttgaacta ggacactcag agatggaagt tctlccaata atccgacacc tlttgcccct 52740 
gttgatggtc ctcctgttga tatcttgcct aacaaggaat tcatgctata tggacgatct 52800 
gctaatagtc ccccatatgg agggcctgct aatgatccac catatggaag acctgccatt 52860 
gatccaccat atggaagacc aatatccaca atatggaaga cctgccaatg atccaccata 52920 



EP 1 437 409 A1 



tagaagacct gtcaatgata catcatattg agggttggac aatgatgggc ctcgtgatca 52980 
ggcccggtcc tgaggggggt cgaatggggc gatcgctccg ggcccccgat tcccagggcc 53040 
cccacctatc igtgcaacga gtagtagcga tcttccagcg cgcaacgtga ggcgatgttt 53100 
ctccgtgatt tcgccggcct gcaactgcga gatcgcgagt ataacgatca gccgatcgat 53160 
ctcatctgcc gactgccatg ctgatgccac acgcaagcgc agcatatcag ccttatcttg 53220 
gttgatcggc atgctggacg agcacatctg ttgtcgcatc aactgctgac Igctatatat 53280 
gtgctggtgc tgaatcgatc gattgtcgtc gcggaagtga agaacaacca cggcactgct 53340 
gcctgctggg ctctagccgc catcagtaag tacgctatac tgcctatcta gatctagatc 53400 
gagattacat agtggaatta tctgtttata acaaaattac aaggtatcaa ttgataattt 53460 
aaggttataa ccgtacaaac ttcagtgatt tgclggtttc acattggtta gattigtttc 53520 
aactaatttg gtacttclgt agccttgtaa tttacgaatc tagtattaat attttcttaa 53580 
gtattagcct gttccttgat attatgctgt tgagaaagta tgcaatagat aacaaaaaca 53640 
agtaggtgtg ttgaggatgc tcaagagtaa tacagccact tcaataattc tgatattatc 53700 
aggacatcat caataattct gcgcctacaa atcttcaaag aaaattttaa tataatgcgt 53760 
atgatttttt aaatacgaat attgattgct atttaaagat atttatatta tatggtaatt 53820 
attatttgaa ggtttataat aaaggcctcc gtttttagtt tcacgctggg ccttcagaat 53880 
ctcaggaccg gccctgctca tgatc 53905 

<210> 29 

<211> 24 

<212> DNA 

<2I3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 29 

atcaggagcc ttcaaattgg gaac 24 

<210> 30 
<211> 24 
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<2l2> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 
<400> 30 

ctcgcaaatt gcttaatttt gacc 24 



<210> 31 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 31 

tgaaggagtt atgggtgcgt gacg 24 

<210> 32 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 32 

ttgccgagca cacttgccat gtgc 24 



<210> 33 

<211> 24 

<212> DNA 

<213> artificial sequence 
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<220> 

<223> Oligonucleotide primer for amplification 
<400> 33 

gcgacgcaat ggacatagtg ctcc 24 

<2l0> 34 
<211> 24 
<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 34 

ttacctgcca agcaatatcc atcg 24 

<210> 35 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 35 

aaggcatact cagtggaggg caag 24 

<210> 36 

<2ll> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 
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<400> 36 

ttaacctgac cgcaagcacc tgtc 24 

<210> 37 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 37 

tggatggact atgtggggtc agtc 24 

<210> 38 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 38 

agtggaagtg gagagagtag ggag 24 

<210> 39 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 39 

ccctccaaca cataaatggt tgag 24 
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<2I0> 40 
<21l> 24 
<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 
<400> 40 

tttctgccag gaaactgtta gatg 24 

<2I0> 41 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 41 

gcgatcttal acgcatacta tgcg 24 

<210> 42 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 42 

aaagtctttg ttccttcacc aagg 24 



<210> 43 
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<211> 26 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 
<400> 43 

gaggatttat caaaacagga tggacg 26 

<210> 44 
<2ll> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 44 

tgggcggcag cagtggagga taga 24 

<210> 45 

<211> 24 

<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 45 

aagaagggag ggttatagaa tctg 24 

<2l0> 46 

<2ll> 24 

<2l2> DNA 
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<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 46 

atatcaggac taacaccact gctc 24 

<210> 47 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 47 

acgagtagta gcgatcttcc agcg 24 

<210> 48 

<2U> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 48 

cagcgtgaaa ctaaaaacgg aggc 24 

<210> 49 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 
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<223> Oligonucleotide primer for amplification 
<400> 49 

atcccacatc atcataatcc gacc 24 

<210> 50 
<211> 25 
<2I2> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 
<400> 50 

agcttctccc ttggatacgg tggcg 25 

<210> 51 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 51 

atttgttggt tagttgcggc tgag 24 

<210> 52 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 52 
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gcccaaactc aaaaggagag aacc 24 

<210> 53 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 53 

cctcaagtct cccctaaagc cact 24 

<210> 54 

<211> 24 

<2I2> DNA 

<213> artificial sequence 
<220> 

<223> 01 igonucleot id.e primer for amplification 

<400> 54 

gctctactgc tgataaaccg tgag 24 

<2lO> 55 

<2ll> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 55 

tggatggact atgtggggtc agtc 24 
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<210> 56 
<21l> 24 
<212> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 
<400> 56 

agtggaagtg gagagagtag ggag 24 

<2l0> 57 

<21l> 24 

<212> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 57 

tacgacgcca tttcactcca ttgc 24 

<210> 58 

<211> 24 

<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 58 

catttctcta tgggcgttgc tctg 24 



<210> 59 
<211> 26 
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<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amol i f i cat ion 
<400> 59 

acctgtaggt atggcacctt caacac 26 

<210> 60 

<21l> 26 

<2l2> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 60 

ccaaggaacg aagttcaaat gtatgg 26 

<210> 61 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 61 

tgatgtgttt gggcatccct ttcg 24 

<210> 62 

<211> 24 

<212> DNA 

<213> artificial sequence 
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<220> 

<223> Oligonucleotide primer for amplification 

<400> 62 

gagatagggg acgacagaca cgac 24 

<210> 63 

<21l> 26 . 

<2l2> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 63 

tcctatggct gtttagaaac tgcaca 26 

<2lO> 64 

<211> 24 

<2l2> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 64 

caagttcaaa cataactggc gttg 24 

<2l0> 65 

<211> 24 

<212> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 
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<400> 65 

cactgtcctg taagtgtgct gtgc 24 

<2l0> 66 

<2ll> 24 

<2l2> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 66 

caagcgtgtg ataaaatgtg acgc 24 

<210> 67 

<211> 24 

<2l2> DNA 

<213> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 67 

tgcctactgc cattactalg tgac 24 

<2lO> 68 

<211> 24 

<212> DNA 

<2l3> artificial sequence 
<220> 

<223> Oligonucleotide primer for amplification 

<400> 68 

acatactacc gtaaatggtc tctg 24 
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Claims 

1 . A method for restoring rice fertility comprising introducing a nucleic acid into rice, wherein the nucleic acid has the 
base sequence of SEQ ID NO.27, or has a base sequence which is identical to at least 70% of the base sequence 
of SEQ ID N0.27, and which functions to restore fertility. 

2. A method for restoring rice fertility comprising introducing a nucleic acid into rice, wherein the nucleic acid has the 
base sequence of bases 38538-54123 of SEQ ID NO.27, or has a base sequence which is identical to at least 
70% of the base sequence of bases 38538-54123 of SEQ ID N0.27, and which functions to restore fertility. 

A method for restoring rice fertility comprising introducing a nucleic acid into rice, wherein the nucleic acid has the 
base sequence of bases 42132-48883 of SEQ ID NO.27, or has a base sequence which is identical to at least 
70% of the base sequence of bases 42132-48883 of SEQ ID N0.27, and which functions to restore fertility. 

The method of any one of Claims 1-3, wherein the nucleic acid having a base sequence which is identical to at 
ieast 70% of the base sequence of SEQ ID NO.27 or of the base sequence of bases 38538-541 23 of SEQ ID NO. 
27, and which meets at least one of the following requirements 1) and 2): 

1) a base corresponding to the base 45461 of SEQ ID NO.27 is A; and 

2) a base corresponding to the base 49609 of SEQ ID NO.27 is A. 

The method of any one of Claims 1 -4, wherein the nucleic acid comprising the rice restorer gene locus introduced 
into rice, does not comprise any constitutional gene other than the rice restorer gene. 

A method for discerning whether or not a subject rice individual or a seed thereof has the rice restorer gene (the 
Rf-1 gene) or not, wherein the method utilizing a fact that a sequence determining the presence of the function of 
the Rf-1 gene positions between the polymorphism detection marker loci P4497 Mbol and B56691 Xba I on rice 
chromosome 10. 

The method of Claim 6 wherein the Rf-1 gene exists in a nucleic acid having the base sequence of SEQ ID NO. 
27, or a nucleic acid having a base sequence which is identical to at least 70% of the base sequence of SEQ ID 
NO.27. 

The method of Claim 6 wherein the Rf-1 gene exists in a nucleic acid having the base sequence of bases 
38538-54123 of SEQ ID NO.27, or a nucleic acid having a base sequence which is identical to at least 70% of the 
base sequence of bases 38538-54123 of SEQ ID NO.27. 



9. The method of claim 6 wherein the Rf-1 gene exists in a nucleic acid having the base sequence of bases 
42132-48883 of SEQ ID NO.27, or a nucleic acid having a base sequence which is identical to at least 70% of the 

^0 base sequence of bases 42132-48883 of SEQ ID NO.27. 

10. The method of any one of claims 7-9 wherein the subject rice individual or the seed thereof is determined to have 
the Rf-1 gene, in the case that the nucleic acid having a base sequence which is identical to at least 70% of the 
base sequence of SEQ ID NO.27 or of the base sequence of bases 38538-541 23 of SEQ ID NO.27, meets at least 

45 one of the following requirements 1) and 2): 

1) a base corresponding to the base 45461 of SEQ ID NO.27 is A; and 

2) a base corresponding to the base 49609 of SEQ ID NO.27 is A. 



11. The method of any one of Claims 7-10, wherein the method comprises: 

i) preparing a pair of primers based on a base sequence of adjacent regions including any one of the following 
base) 



1 ) a base corresponding to the base 45461 ; or 

2) a base corresponding to the base 49609; to amplify both the base of the above and adjacent reqions 
thereto; 
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ii) performing nucleic acid amplification reaction(s) using the genome DNA of the subject rice individual or the 
seed thereof as a template; and 

iii) discerning the presence of the Rf-1 in the subject rice individual or the seed thereof based on polymorphism 
found in said nucleic acid amplification product. 

12. The method of Claim 11 wherein the subject rice individual or the seed thereof is determined to have the Rf-1 
gene, in the case that the step iii) meets least one of the following requirements 1) and 2): 

1) a region including a base corresponding to the base 45461 of SEQ ID N0.27 does not have any Taql 
recognition sequence; and 

2) a region including a base corresponding to the base 49609 of SEQ ID NO. 27 does not have any BstU 
recognition sequence. 

13. The method of Claim 11 or 12 using a pair of primers having base sequences selected from the group consisting 
of SEQ ID NO:45 and 46, and SEQ ID NO:47 and 48. 

14. A method for inhibiting the function of the Rf-1 gene to restore fertility by introducing an antisense having at least 
100 continuous bases in length, and having a base sequence complementary to a nucleic acid having the base 
sequence of SEQ ID NO.27, or to a nucleic acid having a base sequence which is identical to at least 70% of the 
base sequence of SEQ ID NO.27, and which functions to restore fertility. 

15. A method for inhibiting the function of the Rf-1 gene to restore fertility by introducing an antisense having at least 
1 00 continuous bases in length, and being selected from base sequences complementary to a nucleic acid having 
the base sequence of bases 38538-54123 of SEQ ID NO.27, or to a nucleic acid having a base sequence which 
is identical to at least 70% of the base sequence of bases 38538-54123 of SEQ ID NO.27, and which functions to 
restore fertility. 

16. A nucleic acid having the base sequence of SEQ ID NO.27, or a nucleic acid having a base sequence which is 
identical to at least 70% of the base sequence of SEQ ID NO.27, and which functions to restore fertility. 

1 7. A nucleic acid having the base sequence of bases 38538-541 23 of SEQ ID NO.27, or a nucleic acid having a base 
sequence which is identical to at least 70% of the base sequence of bases 38538-54123 of SEQ ID NO.27, and 
which functions to restore fertility. 

1 8. A nucleic acid having the base sequence of bases 421 32-48883 of SEQ ID NO.27, or a nucleic acid having a base 
sequence which is identical to at least 70% of the base sequence of bases 42132-48883 of SEQ ID NO.27, and 
which functions to restore fertility. 
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